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STATEMENT AS TO FEDERALLY SPONSORED RESEARCH 

5 

This invention was made with Government support under Grant No. CA433 1 8 
and CA54396, awarded by the National Cancer Institute and Grant No. CA433 1 8, a 
Colon Cancer Spore Grant. The government may have certain rights in the invention. 

10 FIELD OF THE INVENTION 

The present invention relates generally to the regulation of gene expression and 
more specifically to a method of determining the DNA methylation status of CpG sites in 
a given locus and correlating the methylation status with the presence of a cell 
15 proliferative disorder. 

BACKGROUND OF THE INVENTION 

DNA methylases transfer methyl groups from the universal methyl donor S- 
20 adenosyl methionine to specific sites on the DNA. Several biological functions have 
been attributed to the methylated bases in DNA. The most established biological 
function for methylated DNA is the protection of DNA from digestion by cognate 
restriction enzymes. The restriction modification phenomenon has, so far, been observed 
only in bacteria. Mammalian cells, however, possess a different methylase that 
25 exclusively methy-lates cytosine residues that are 5' neighbors of guanine (CpG). This 
modification of cytosine residues has important regulatory effects on gene expression, 
especially when involving CpG rich areas, known as CpG islands, located in the promoter 
regions of many genes. 
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Methylation has been shown by several lines of evidence to play a role in gene 
activity, cell differentiation, tumorigenesis, X-chromosome inactivation, genomic 
imprinting and other major biological processes (Razin, A., H., and Riggs, R.D. eds. in 
DNA Methylation Biochemis-try and Biological Significance . Springer- Verlag, New 
5 York, 1984). In eukaryotic cells, methylation of cytosine residues that are immediately 5' 
to a guanosine, occurs predominantly in CG poor regions (Bird, A., Nature, 321 :209. 
1 986). In contrast, CpG islands remain unmethylated in normal cells, except during X- 
chromosome inactivation (Migeon, et al., supra) and parental specific imprinting (Li, et 
al., Nature, 366 :362. 1993) where methylation of 5' regulatory regions can lead to 

10 transcriptional repression. De novo methylation of the Rb gene has been demonstrated in 
a small fraction of retinoblastomas (Sakai, et al., Am. J. Hum. Genet., 48:880, 1991), and 
recently, a more detailed analysis of the VHL gene showed aberrant methylation in a 
subset of sporadic renal cell carcinomas (Herman, et al., Proc. Natl. Acad. Sci., U.S.A., 
91:9700, 1994). Expression of a tumor suppressor gene can also be abolished by de novo 

15 DNA methylation of a normally unmethylated CpG island (Issa, et al., Nature Genet., 
7:536, 1994; Herman, et al., supra; Merlo, et al., Nature Med., 1:686, 1995; Herman, et 
al., Cancer Res., 56:722, 1996; Graff, et al., Cancer Res., 55_:5195, 1995; Herman, et al., 
Cancer Res., 55:4525, 1995). 

20 Human cancer cells typically contain somatically altered nucleic acid, character- 

ized by mutation, amplification, or deletion of critical genes. In addition, the nucleic acid 
from human cancer cells often displays somatic changes in DNA methylation (E.R. 
Fearon, et al., Cell, 61:759, 1990; P.A. Jones, et al., Cancer Res., 46:461, 1986; R. 
Holliday, Science, 238:163, 1987; A. De Bustros, et al., Proc. Natl. Acad. Sci., USA, 

25 85:5693, 1988); P.A. Jones, et al., Adv. Cancer Res., 54: 1, 1990; S.B. Baylin, et al., 

Cancer Cellsa:383, 1991; M. Makos, et al., Proc. Natl. Acad. Sci., USA, 82:1929, 1992; 
N. Ohtani-Fujita, et al., Onco-gene, 8:1063, 1993). However, the precise role of 
abnormal DNA methylation in human tumorigenesis has not been established. 
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Aberrant methylation of normally unmethylated CpG islands has been described as a 
frequent event in immortalized and transformed cells, and has been associated with 
transcriptional inactivation of defined tumor suppressor genes in human cancers. In the 
development of colorectal cancers (CRC), a series of tumor suppressor genes (TSG) such 
5 as APC, p53, DCC and DPC4 are inactivated by mutations and chromosomal deletions 
(reviewed in Kinzler and Vogelstein 1996). Some of these alterations result from a 
chromosomal instability phenotype described in a subset of CRC (Lengauer et al., 
1997a). Recently, an additional pathway has been shown to be involved in a familial form 
of CRC, hereditary non-polyposis colorectal cancer. The cancers from these patients 

10 show a characteristic mutator phenotype which causes microsatellite instability (MI), and 
mutations at other gene loci such as TGF-beta-RII (Markowitz et al., 1995) and BAX 
(Rampino et al., 1997). This phenotype usually results from mutations in the mismatch 
repair (MMR) genes hMSH2 and hMLHl (reviewed by Peltomaki, and de la Chapelle, 
1997). A subset of sporadic CRC also show MI, but mutations in MMR genes appear to 

15 be less frequent in these tumors (Liu et al., 1995; Moslein et al., 1996). 

Another molecular defect described in CRC is CpG island (CGI) methylation. 
CGIs are short sequences rich in the CpG dinucleotide and can be found in the 5' region 
of about half of all human genes. Methylation of cytosine within 5' CGIs is associated 

20 with loss of gene expression and has been seen in physiological conditions such as X 

chromosome inactivation and genomic imprinting (reviewed in Latham, 1996). Aberrant 
methylation of CGIs has been detected in genetic diseases such as the fragile-X 
syndrome, in aging cells and in neoplasia. About half of the tumor suppressor genes 
which have been shown to be mutated in the germline of patients with familial cancer 

25 syndromes have also been shown to be aberrantly methylated in some proportion of 
sporadic cancers, including Rb, VHL, pi 6, hMLHl, and BRCA1 (reviewed in Baylin, 
S.B., Herman, J.G., Graff, J.R., Vertino, P.M. and Issa, J.P. Alterations in DNA 
methylation: a fundamental aspect of neoplasia, Adv. Cancer Res. 72:141-196 1998). 
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TSG methylation in cancer is usually associated with (1) lack of gene transcription and 
(2) absence of coding region mutation. Thus it has been proposed that CGI methylation 
serves as an alternative mechanism of gene inactivation in cancer. 

5 The causes and global patterns of CGI methylation in human cancers remain 

poorly defined. Aging could play a factor in this process because methylation of several 
CGIs could be detected in an age-related manner in normal colon mucosa as well as in 
CRC (Issa, J.P., Vertino. P.M., Boehm, CD., Newsham, I.F. and Baylin, S.B. 
Methylation of the oestrogen receptor CpG island links ageing and neoplasia in human 

10 colon. Nat Genet. 7:536-540, 1994). In addition, aberrant methylation of CGIs has been 
associated with the MI phenotype in CRC as well as specific carcinogen exposures (Issa 
et al., 1996) supra. However, an understanding of aberrant methylation in CRC has been 
somewhat limited by the small number of CGIs analyzed to date. In fact, previous studies 
have suggested that large numbers of CGIs are methylated in immortalized cell lines 

15 (Antequera, F., Boyes, J. and Bird, A. High levels of de novo methylation and altered 

chromatin structure at CpG islands in cell lines. Cell 6^2:503-514, 1990), and it is not well 
understood whether this global aberrant methylation is caused by the cell culture 
conditions or whether they are an integral part of the pathogenesis of cancer. 

20 Most of the methods developed to date for detection of methylated cytosine 

depend upon cleavage of the phosphodiester bond alongside cytosine residues, using 
either methylation-sensitive restriction enzymes or reactive chemicals such as hydrazine 
which differentiate between cytosine and its 5-methyl derivative. Genomic sequencing 
protocols which identify a 5-MeC residue in genomic DNA as a site that is not cleaved by 

25 any of the Maxam Gilbert sequencing reactions have also been used, but still suffer 
disadvantages such as the requirement for large amount of genomic DNA and the 
difficulty in detecting a gap in a sequencing ladder which may contain bands of varying 
intensity. 
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Mapping of methylated regions in DNA has relied primarily on Southern 
hybridization approaches, based on the inability of methylation-sensitive restriction 
enzymes to cleave sequences which contain one or more methylated CpG sites. This 
5 method provides an assessment of the overall methylation status of CpG islands, 
including some quantitative analysis, but is relatively insensitive and requires large 
amounts of high molecular weight DNA. 

Another method utilizes bisulfite treatment of DNA to convert all unmethylated 
10 cytosines to uracil. The altered DNA is amplified and sequenced to show the methylation 
status of all CpG sites. However, this method is technically difficult, labor intensive and 
without cloning amplified products, it is less sensitive than Southern analysis, requiring 
approximately 10% of the alleles to be methylated for detection. 

15 Identification of the earliest genetic changes in tumorigenesis is a major focus in 

molecular cancer research. Diagnostic approaches based on identification of these 
changes are likely to allow implementation of early detection strategies and novel 
therapeutic approaches targeting these early changes might lead to more effective cancer 
treatment. 

20 

SUMMARY OF THE INVENTION 

The present invention is based on the finding that several genes are newly 
identified as being differentially methylated in cancer. This seminal discovery is useful 
25 for cancer screening, risk-assessment, prognosis, minimal-residual disease identification, 
staging and identification of therapeutic targets. The identification of new genes that are 
methylated in cancer, aging or diseases associated with aging increases the likelihood of 
finding genes methylated in a particular cancer; increases the sensitivity and specificity of 



methylation detection; allows methylation profiling using multiple genes; and allows 
identification of new targets for therapeutic intervention. The invention also provides a 
newly identified gene that is a target for hypermethylation in human tumors. This new 
gene, as well as genes newly identified as hypermethylated in cancer and aging or aging 
diseases provides markers which can be used diagnostically, prognostically and 
therapeutically over the course of such disorders. 

In a first embodiment, the invention provides a nucleic acid molecule comprising 
a coding region for a T-type calcium channel, CACNA1G, and regulatory sequences 
associated therewith. The discovery of CpG islands, and in particular, methylated CpG 
islands in the region approximately 300-800 base pairs upstream from the CACNA1G 
translation initiation start site, led to a method of the present invention for correlating 
methylated CpG islands with various cancers. In one aspect of this embodiment, the 
nucleic acid molecule encoding CACNA1G and the associated regulatory sequences and 
CpG-rich regions include the nucleic acid sequence set forth in SEQ ID NO:5 1 (Figure 
3 A). Also provided is a polypeptide having an amino acid sequence as set forth in SEQ 
ID NO:52 and Figure 3B. The methylation state of CpG islands in CACNA1G, 
associated regulatory regions, and other genes is indicative of the presence of a cellular 
proliferative disorder in a subject from which the CpG-containing nucleic acid is isolated. 

In another embodiment, there are provided methods for detecting a cellular 
proliferative disorder in a subject. The subject may have or be at risk of having a cellular 
proliferative disorder. The method of the invention is useful for diagnostic as well as 
prognostic analyses. One method for detecting a cellular proliferative disorder in a 
subject includes contacting a nucleic acid-containing specimen from the subject with an 
agent that provides a determination of the methylation state of at least one gene or 
associated regulatory region of the gene; and identifying aberrant methylation of regions 
of the gene or regulatory region, wherein aberrant methylation is identified as being 
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different when compared to the same regions of the gene or associated regulatory region 
in a subject not having the cellular proliferative, thereby detecting a cellular proliferative 
disorder in the subject. The method includes multiplexing by utilizing a combination of 
primers for more than one loci, thereby providing a methylation "profile" for more than 
5 one gene or regulatory region. 

For the first time, the invention provides methylated forms of the following genes 
and/or their associated regulatory sequences: APOB, CACNA1G, CDX2, EGFR, FBN1, 
GPR37, HSPA6, IQGAP2, KL, PAR2, PITX2, PTCH, SDC1, and SDC4 (see Table 5). 
10 In addition, the invention provides the CpG-rich regions from these genes that are 
hypermethylated (see Figures 4A-4F (SEQ ID NO: 105-1 19). 

Invention methods include determining, in a nucleic acid-containing specimen 
taken from a subject, the methylation state of a gene or regulatory sequences associated 

1 5 therewith, wherein the expression or non-expression of the gene is associated with the 
presence of the cellular proliferative disorder, and identifying as having a cellular 
proliferative disorder a subject that has aberrant methylation of regions of the gene or 
associated regulatory sequences when compared to the same regions of the gene in a 
subject not having the cellular proliferative disorder. In one aspect of this embodiment, 

20 the methylated regions of the gene and associated regulatory sequences are contained 
within CpG islands (i.e., CpG rich regions). In particular, the aberrant methylation 
typically includes hypermethylation as compared with the same regions of the gene or 
regulatory sequences in a subject not having the cellular proliferative disorder. 

25 Determining the methylation state of the gene includes contacting the nucleic 

acid-containing specimen with an agent that modifies unmethylated cytosine, amplifying 
a CpG-containing nucleic acid in the specimen by means of CpG-specific oligonucleotide 
primers, wherein the oligonucleotide primers distinguish between modified methylated 
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and nonmethylated nucleic acid, and detecting the methylated nucleic acid based on the 
presence or absence of amplification products produced in said amplifying step. The 
method includes optionally contacting the amplification products with a methylation 
sensitive restriction endonuclease. Other methods for determining methylation status of a 
5 gene and/or regulatory sequences are well known in the art and are described more fully 
herein. 

In another embodiment, the present invention provides a method of treating a cell 
proliferative disorder associated with CACNA1G or other methylated genes described 

10 herein, comprising administering to a subject with the disorder, a therapeutically effective 
amount of reagent which modulates CACNA1G or other methylated genes' expression. 
For example, since CACNA1G -associated disorders typically involve hypermethylation 
of CACNA1G polynucleotide sequence, a polynucleotide sequence which contains a non- 
methylatable nucleotide analog is utilized for treatment of a subject. Further, the 

1 5 invention provides a method of gene therapy comprising introducing into cells of a host 
subject, an expression vector comprising a nucleotide sequence encoding CACNA1G or 
other methylated genes described herein, in operable linkage with a promoter. 

In another embodiment of the present invention there is provided a kit useful for 
20 the detection of a cellular proliferative disorder in a subject having or at risk for having a 
cellular proliferative disorder. Invention kits include a carrier means compartmentalized 
to receive a sample, one or more containers comprising a first container containing a 
reagent which modifies unmethylated cytosine and a second container containing primers 
for amplification of a CpG-containing nucleic acid, wherein the primers distinguish 
25 between modified methylated and nonmethylated nucleic acid, and optionally, a third 
container containing a methylation sensitive restriction endonuclease. 
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BRIEF DESCRIPTION OF THE FIGURES 

Figure 1A shows the relative positions of MINT31 and CACNA1G and associated 
CpG regions. 

5 Figure 1 B provides a magnified depiction of MINT3 1 , C ACNA 1 G and CpG rich 

regions 1-8. 

Figure 2 depicts the correlation of methylation and the expression of CACNA1G. 

Figures 3 A and 3B show the nucleic acid sequence and deduced amino acid 
sequence of CACNA1G (SEQ ID NO:51 and 52, respectively). 
10 Figures 4A-4F show the CpG-rich regions of the genes depicted in Table 5 (SEQ 

ID NO: 105- 119). 

Figure 5 is the nucleotide sequence of MINT31. (SEQ ID NO: 120). 

DETAILED DESCRIPTION OF THE INVENTION 

15 

It has been determined that an aberrant methylation state of nucleic acids in 
certain genes, particularly regulatory sequences, is diagnostic for the presence or potential 
development of a cellular proliferative disorder in subjects bearing the aberrantly 
methylated nucleic acids. More particularly, the hypermethylation of certain nucleotides 

20 localized in CpG islands has been shown to affect the expression of genes associated with 
the CpG islands; typically such hypermethylated genes have reduced or abolished 
expression, primarily due to down-regulated transcription. Using a recently developed 
PCR-based technique called methylated CpG island amplification (MCA), several nucleic 
acid molecules aberrantly methylated in a colon cancer cell line were identified. One 

25 DNA fragment, termed MINT3 1 , mapped to human chromosome 1 7q21 where frequent 
loss of heterozygosity (LOH) has been detected in various human tumors. By 
characterizing the genomic sequence around this area, a gene encoding a T-type calcium 
channel, CACNA1G, was identified as a target for hypermethylation in human tumors. 
Using RT-PCR, expression of CACNA1G was detected in normal colon and bone 
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marrow, but expression was absent in 5 tumor cell lines where methylation was found. 
After treatment with the methylation inhibitor 5-deoxy-azacytidine, the expression of 
CACNA1G was restored in all 5 cell lines. Detailed methylation mapping of the 5'CpG 
island by bisulfite-PCR revealed that methylation of a region 300 to 800 base pairs 
5 upstream of the translation initiation site closely correlated with the inactivation of 
CACNA1G. Aberrant methylation of CACNA1G was also examined in various human 
primary tumors, and was detected in 17 of 49 (35%) colorectal cancers, 4 of 16 (25%) 
gastric cancers, and 3 of 23 (13%) acute myelogenous leukemia cases. While not 
wanting to be bound by a particular theory, it is believed that inactivation of CACNA1G 
10 may play a role in cancer development by modulating calcium signaling, which 
potentially affects cell proliferation and apoptosis. 

Thus, in one embodiment of the present invention, there are provided nucleic 
acids comprising the coding region for a T-type calcium channel and regulatory 
15 sequences associated therewith. Specifically, the T-type calcium channel and associated 
regulatory sequences comprise CACNA1G. In a more preferred embodiment, the 
CACNA1G is the human form of the gene. An exemplary CACNA1G gene and 
associated regulatory sequences is set forth in SEQ ID NO:51 . 

20 The invention provides methylated and unmethylated nucleic acid encoding 

CACNA1 G (SEQ ID NO:5 1). Polynucleotides include DNA, cDNA and RNA 
sequences which encode CACNA1G polypeptide (SEQ ID NO:52). It is understood that 
naturally occuring, synthetic, and intentionally manipulated polynucleotides are included. 
For example, CACNA1G nucleic acid may be subject to site-directed mutagenesis, or the 

25 like. The nucleic acid sequences for CACNA1G also include antisense sequences, and 
sequences encoding dominant negative forms of CACNA1G, as well as sequences 
encoding functional fragments thereof. It is understood that naturally occurring, synthetic, 
and intentionally manipulated polynucleotides are included. 
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Methylated nucleic acid sequences are also provided. For the first time, the 
present invention provides methylated chemical structures for the following genes: 
APOB, CACNA1G, CDX2, EGFR, FBN1, GPR37, HSPA6, IQGAP2, KL, PAR2, 
5 PITX2, PTCH, SDC 1 , and SDC4. One of skill in the art can now readily locate the 
CpG-rich sequences associated with these genes and identify such methylated forms of 
the genes/regulatory sequences by methods described herein (The gene sequences can be 
identified in a gene database found at http://www.ncbi.nim.nih.gov/UniGene/index.html). 
The invention provides CpG-rich regions from the above genes as set forth in SEQ ID 

1 0 Nos: 1 05- 1 1 9. Thus, in yet another embodiment, the invention provides an isolated 

nucleic acid molecule having at least one methylated Cytosine of a CpG dinucleotide in a 
CpG-rich region and encoding a gene selected from APOB, CACNA1G, CDX2, EGFR, 
FBN1, GPR37, HSPA6, IQGAP2, KL, PAR2, PITX2, PTCH, SDC1, and SDC4. The 
methylated C residue of a CpG dinucleotide is located within a CpG-rich region selected 

15 from SEQ ID NO:105-1 18 and SEQ ID NO:l 19. 



The polynucleotides of the invention include "degenerate variants" which are 
sequences that encode the same polypeptide yet vary in sequence as a result of the genetic 
code. There are 20 natural amino acids, most of which are specified by more than one 
20 codon. Therefore, all degenerate nucleotide sequences are included in the invention as 
long as the amino acid sequence of a polypeptide encoded by the nucleotide sequence of 
SEQ ID NO:51 is functionally unchanged. 

The term "polynucleotide"or "nucleic acid sequence" refers to a polymeric form 
25 of nucleotides at least 1 0 bases in length. An "isolated polynucleotide" is a 

polynucleotide that is not immediately contiguous with both of the coding sequences with 
which it is immediately contiguous (one on the 5' end and one on the 3' end) in the 
naturally occurring genome of the organism from which it is derived. Thus, an isolated 
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polynucleotide may include a coding region with its associated regulatory sequences. 
The term therefore includes, for example, a recombinant DNA which is incorporated into 
a vector; into an autonomously replicating plasmid or virus; or into the genomic DNA of 
a prokaryote or eukaryote, or which exists as a separate molecule (e.g., a cDNA) 
5 independent of other sequences. The nucleotides of the invention can be ribonucleotides, 
deoxyribonucleotides, or modified forms of either nucleotide. Specifically, methylated 
forms of nucleotides in a polynucleotide sequence, such as regions 1-8 of CACNA1G as 
described herein, are also included. The term includes single and double forms of DNA. 

10 As will be understood by those of skill in the art, when the sequence is RNA, the 

deoxynucleotides A, G, C, and T of SEQ ID NO:51, are replaced by ribonucleotides A, 
G, C, and U, respectively. Also included in the invention are fragments of the above- 
described nucleic acid sequences that are at least 15 bases in length, which is sufficient to 
permit the fragment to selectively hybridize to DNA that encodes SEQ ID NO:52. The 

15 term "selectively hybridize" refers to hybridization under moderately or highly stringent 
conditions (See, Maniatis, as cited herein) which excludes non-related nucleotide 
sequences. 

The CACNA1G nucleic acid sequence includes the disclosed sequence and 
20 sequences that encode conservative variations of the polypeptides encoded by 

CACNA1G polynucleotide provided herein. The term "conservative variation" as used 
herein denotes the replacement of an amino acid residue by another, biologically similar 
residue. Examples of conservative variations include the substitution of one hydrophobic 
residue such as isoleucine, valine, leucine or methionine for another, or the substitution of 
25 one polar residue for another, such as the substitution of arginine for lysine, glutamic for 
aspartic acid, or glutamine for asparagine, and the like. The term "conservative 
variation" also includes the use of a substituted amino acid in place of an unsubstituted 
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parent amino acid provided that antibodies raised to the substituted polypeptide also 
immunoreact with the unsubstituted polypeptide. 

CACNA1G nucleic acid sequences can be expressed in vitro by DNA transfer into 
5 a suitable host cell. "Host cells" are cells in which a vector can be propagated and its 
DNA expressed. The cell may be prokaryotic or eukaryotic. The term also includes any 
progeny of the subject host cell. It is understood that all progeny may not be identical to 
the parental cell since there may be mutations that occur during replication. However, 
such progeny are included when the term "host cells" is used. Methods of stable transfer, 
10 meaning that the foreign DNA is continuous-ly maintained in the host, are known in the 
art. 

In one aspect, the CACNA1G nucleic acid sequences may be inserted into an 
expression vector. The term "expression vector" refers to a plasmid, virus or other 

15 vehicle known in the art that has been manipulated by insertion or incorporation of the 
sequence of interest genetic sequences. Polynucleotide sequence which encode sequence 
of interest can be operatively linked to expression control sequences. "Operatively 
linked" refers to a juxtaposition wherein the components so described are in a 
relationship permitting them to function in their intended manner. An expression control 

20 sequence operatively linked to a coding sequence is ligated such that expression of the 
coding sequence is achieved under conditions compatible with the regulatory or 
expression control sequences. As used herein, the terms "regulatory sequences" and 
"expression control sequences" refers to nucleic acid sequences that regulate the 
expression of a nucleic acid sequence to which it is operatively linked. Expression 

25 control sequences are operatively linked to a nucleic acid sequence when the expression 
control sequences control and regulate the transcription and, as appropriate, translation of 
the nucleic acid sequence. Thus expression control sequences can include appropriate 
promoters, enhancers, transcription terminators, a start codon (i.e., ATG) in front of a 
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protein-encoding gene, splicing signal for introns, maintenance of the correct reading 
frame of that gene to permit proper translation of mRNA, and stop codons. The terms 
"regulatory sequences" and "expression control sequences" are intended to included, at a 
minimum, components whose presence can influence expression, and can also include 
additional components whose presence is advantageous, for example, leader sequences 
and fusion partner sequences. An example of an expression control sequence includes a 
promoter. 

A "promoter" is a minimal sequence sufficient to direct transcription. Also 
included in the invention are those promoter elements which are sufficient to render 
promoter-dependent gene expression controllable for cell-type specific, tissue-specific, or 
inducible by external signals or agents; such elements may be located in the 5' or 3' 
regions of the gene. Both constitutive and inducible promoters, are included in the 
invention (see, e.g., Bitter et al., Methods in Enzymology 153:516-544, 1987). For 
example, when cloning in bacterial systems, inducible promoters such as pL of 
bacteriophage plac, ptrp, ptac (ptrp-lac hybrid promoter) and the like may be used. 
When cloning in mammalian cell systems, promoters derived from the genome of 
mammalian cells (e.g., metallothionein promoter) or from mammalian viruses (e.g., the 
retrovirus long terminal repeat; the adenovirus late promoter; the vaccinia virus 7.5K 
promoter) may be used. Promoters produced by recombinant DNA or synthetic 
techniques may also be used to provide for transcription of the nucleic acid sequences of 
the invention. 

In the present invention, the CACNA1G polynucleotide sequence may be inserted 
into an expression vector which contains a promoter sequence which facilitates the 
efficient transcription of the inserted genetic sequence of the host. The expression vector 
typically contains an origin of replication, a promoter, as well as specific genes which 
allow phenotypic selection of the transformed cells. Vectors suitable for use in the 
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present invention include, but are not limited to the T7-based expression vector for 
expression in bacteria (Rosenberg et aL, Gene 56:125, 1987), the pMSXND expression 
vector for expression in mammalian cells (Lee and Nathans, J. Biol. Chem. 263:3521, 
1988) and baculovirus-derived vectors for expression in insect cells. The DNA segment 
5 can be present in the vector operably linked to regulatory elements, for example, a 
promoter (e.g., T7, metallothionein I, or polyhedron promoters). 

CACNA1G polynucleotide sequences can be expressed in either prokaryotes or 
eukaryotes. Hosts can include microbial, yeast, insect and mammalian organisms. 
10 Methods of expressing DNA sequences having eukaryotic or viral sequences in 

prokaryotes are well known in the art. Biologically functional viral and plasmid DNA 
vectors capable of expression and replication in a host are known in the art. Such vectors 
are used to incorp-orate DNA sequences of the invention. 

15 "Transformation" means a genetic change induced in a cell following 

incorporation of new DNA (i.e., DNA exogenous to the cell). Where the cell is a 
mammalian cell, the genetic change is generally achieved by introduction of the DNA 
into the genome of the cell (i.e., stable). 

20 Thus, a "transformed cell" is a cell into which (or into an ancestor of which) has 

been introduced, by means of recombinant DNA techniques, a DNA molecule encoding 
sequence of interest. Transformation of a host cell with recombinant DNA may be carried 
out by conventional techniques as are well known to those skilled in the art. Where the 
host is prokaryotic, such as E. coli, competent cells which are capable of DNA uptake can 

25 be prepared from cells harvested after exponential growth phase and subsequently treated 
by the CaC12 method using procedures well known in the art. Alternatively, MgC12 or 
RbCl can be used. Transforma-tion can also be performed after forming a protoplast of 
the host cell if desired. 
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When the host is a eukaryote, such methods of transfection of DNA as calcium 
phosphate co-precipitates, conventional mechan-ical procedures such as microinjection, 
electroporation, insertion of a plasmid encased in liposomes, or virus vectors may be 
5 used. Eukaryotic cells can also be cotransformed with DNA sequenc-es encoding the 
sequence of interest, and a second foreign DNA molecule encoding a selectable 
phenotype, such as the herpes simplex thymidine kinase gene. Another method is to use 
a eukaryotic viral vector, such as simian virus 40 (SV40) or bovine papilloma virus, to 
transiently infect or transform eukary otic cells and express the protein (see for example, 
10 Eukaryotic Viral Vectors . Cold Spring Harbor Laboratory, Gluzman ed., 1982). 

Methods which are well known to those skilled in the art can be used to construct 
expression vectors containing the CACNA1G coding sequence and appropriate 
transcriptional/translational control signals. These methods include in vitro recombinant 
15 DNA techniques, synthetic techniques, and in vivo recombination/genetic techniques. 

See, for example, the techniques described in Maniatis, et al., 1989 Molecular Cloning A 
Laboratory Manual, Cold Spring Harbor Laboratory, N.Y. 

A variety of host-expression vector systems may be utilized to express the 
20 CACNA1G coding sequence. These include but are not limited to microorganisms such 
as bacteria transformed with recombinant bacteriophage DNA, plasmid DNA or cosmid 
DNA expression vectors containing the CACNA1G coding sequence; yeast transformed 
with recombinant yeast expression vectors containing the CACNA1G coding sequence; 
plant cell systems infected with recombinant virus expression vectors (e.g., cauliflower 
25 mosaic virus, CaMV ; tobacco mosaic virus, TMV) or transformed with recombinant 
plasmid expression vectors (e.g., Ti plasmid) containing the CACNA1G coding 
sequence; insect cell systems infected with recombinant virus expression vectors (e.g., 
baculovirus) containing the CACNA1G coding sequence; or animal cell systems infected 
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with recombinant virus expression vectors (e.g., retroviruses, adenovirus, vaccinia virus) 
containing the CACNA1G coding sequence, or transformed animal cell systems 
engineered for stable expression. Since CACNA1G has not been confirmed to contain 
carbohydrates, both bacterial expression systems as well as those that provide for 
5 translational and post-translational modifications may be used; e.g., mammalian, insect, 
yeast or plant expression systems. 

Depending on the host/vector system utilized, any of a number of suitable 
transcription and translation elements, including constitutive and inducible promoters, 

10 transcription enhancer elements, transcription terminators, etc. may be used in the 

expression vector (see e.g., Bitter, et al., Methods in Enzymology 153:516-544, 1987). 
For example, when cloning in bacterial systems, inducible promoters such as pL of 
bacteriophage .gamma., plac, ptrp, ptac (ptrp-lac hybrid promoter) and the like may be 
used. When cloning in mammalian cell systems, promoters derived from the genome of 

1 5 mammalian cells (e.g., metallothionein promoter) or from mammalian viruses(e.g., the 
retrovirus long terminal repeat; the adenovirus late promoter; the vaccinia virus 7.5K 
promoter) may be used. Promoters produced by recombinant DNA or synthetic 
techniques may also be used to provide for transcription of the inserted CACNA1G 
coding sequence. In addition, the endogenous CACNA1G promoter may also be used to 

20 provide transcription machinery of CACNA1G . 

In bacterial systems a number of expression vectors may be advantageously 
selected depending upon the use intended for the expressed. For example, when large 
quantities of CACNA1G are to be produced, vectors which direct the expression of high 
25 levels of fusion protein products that are readily purified may be desirable. Those which 
are engineered to contain a cleavage site to aid in recovering are preferred. Such vectors 
include but are not limited to the E. coli expression vector pUR278 (Ruther, et al., EMBO 
J. 2:1791, 1983), in which the CACNA1G coding sequence may be ligated into the 
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vector in frame with the lac Z coding region so that a hybrid -lac Z protein is produced; 
pIN vectors (Inouye & Inouye, Nucleic Acids Res., 13:3101-3109, 1985; Van Heeke & 
Schuster, J. Biol. Chem. 264:5503-5509, 1989); glutathione-S-transferase (GST) and the 
like. 

5 

In yeast, a number of vectors containing constitutive or inducible promoters may 
be used. For a review see, Current Protocols in Molecular Biology, Vol. 2, 1988, Ed. 
Ausubel, et al., Green Publish. Assoc. & Wiley Interscience, Ch. 13; Grant, et al., 1987, 
Expression and Secretion Vectors for Yeast, in Methods in Enzymology, Eds. Wu & 

10 Grossman, 31987, Acad. Press, N.Y., Vol. 153, pp. 516-544; Glover, 1986, DNA 

Cloning, Vol. II, IRL Press, Wash., D.C., Ch. 3; and Bitter, 1987, Heterologous Gene 
Expression in Yeast, Methods in Enzymology, Eds. Berger & Rimmel, Acad. Press, 
N.Y., Vol. 152, pp. 673-684; and The Molecular Biology of the Yeast Saccharomyces, 
1982, Eds. Strathern, et al., Cold Spring Harbor Press, Vols. I and II. A constitutive yeast 

15 promoter such as ADH or LEU2 or an inducible promoter such as GAL may be used 
(Cloning in Yeast, Ch. 3, R. Rothstein In: DNA Cloning Vol.1 1, A Practical Approach, 
Ed. DM Glover, 1986, IRL Press, Wash., D.C.). Alternatively, vectors may be used 
which promote integration of foreign DNA sequences into the yeast chromosome. 

20 Incases where plant expression of the CACNA1G coding sequence may be 

driven by any of a number of promoters. For example, viral promoters such as the 35S 
RNA and 19S RNA promoters of CaMV (Brisson, et al., Nature 310:51 1-514, 1984), or 
the coat protein promoter to TMV (Takamatsu, et al., EMBO J.6:307-311, 1987) may be 
used; alternatively, plant promoters such as the small subunit of RUBISCO (Coruzzi, et 

25 al., EMBO 13:1671-1680, 1984; Broglie, et al., Science 224:838-843, 1984); or heat 
shock promoters, e.g., soybean hspl7.5-E or hspl7.3-B (Gurley, et al., Mol. Cell. Biol. 
6:559-565, 1986) may be used. These constructs can be introduced into plant cells using 
Ti plasmids, Ri plasmids, plant virus vectors, direct DNA transformation, microinjection, 
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electroporation, etc. For reviews of such techniques see, for example, Weissbach & 
Weissbach, 1988, Methods for Plant Molecular Biology, Academic Press, N.Y., Section 
VIII, pp. 421-463; and Grierson & Corey, 1988, Plant Molecular Biology, 2d Ed., 
Blackie, London, Ch. 7-9. 

5 

An alternative expression system which could be used to express is an insect 
system. In one such system, Autographa californica nuclear polyhedrosis virus (AcNPV) 
is used as a vector to express foreign genes. The virus grows in Spodoptera frugiperda 
cells. The CACNA1G coding sequence may be cloned into non-essential regions (for 

10 example the polyhedrin gene) of the virus and placed under control of an AcNPV 

promoter (for example the polyhedrin promoter). Successful insertion of the CACNA1G 
coding sequence will result in inactivation of the polyhedrin gene and production of non- 
occluded recombinant virus (i.e., virus lacking the proteinaceous coat coded for by the 
polyhedrin gene). These recombinant viruses are then used to infect Spodoptera 

15 frugiperda cells in which the inserted gene is expressed, (e.g., see Smith, et al., 1983, J. 
Viol. 46:584; Smith, U.S. Pat. No. 4,215,051). 

Eukaryotic systems, and preferably mammalian expression systems, allow for 
proper post-translational modifications of expressed mammalian proteins to occur. 
20 Eukaryotic cells which possess the cellular machinery for proper processing of the 

primary transcript, glycosylation, phosphorylation, and advantageously, secretion of the 
gene product may be used as host cells for the expression of CACNA1G . Mammalian 
cell lines may be preferable. Such host cell lines may include but are not limited to CHO, 
VERO, BHK, HeLa, COS, MDCK, -293, and W138. 

25 

Mammalian cell systems which utilize recombinant viruses or viral elements to 
direct expression may be engineered. For example, when using adenovirus expression 
vectors, the CACNA1G coding sequence may be ligated to an adenovirus 
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transcription/translation control complex, e.g., the late promoter and tripartite leader 
sequence. This chimeric gene may then be inserted in the adenovirus genome by in vitro 
or in vivo recombination. Insertion in a non-essential region of the viral genome (e.g., 
region El or E3) will result in a recombinant virus that is viable and capable of 
5 expressing the protein in infected hosts (e.g., see Logan & Shenk, Proc. Natl. Acad. Sci. 
USA, 81 :3655-3659, 1984). Alternatively, the vaccinia virus 7.5K promoter may be used 
(e.g., see, Mackett, et al., 1982, Proc. Natl. Acad. Sci. USA 79:7415-7419; Mackett, et 
al., J. Virol. 49:857-864, 1984; Panicali, et al., Proc. Natl. Acad. Sci. USA 79:4927-4931, 
1982). Of particular interest are vectors based on bovine papilloma virus which have the 

10 ability to replicate as extrachromosomal elements (Sarver, et al., Mol. Cell. Biol. 1:486, 
1981). Shortly after entry of this DNA into mouse cells, the plasmid replicates to about 
100 to 200 copies per cell. Transcription of the inserted cDNA does not require 
integration of the plasmid into the host's chromosome, thereby yielding a high level of 
expression. These vectors can be used for stable expression by including a selectable 

15 marker in the plasmid, such as, for example, the neo gene. Alternatively, the retroviral 
genome can be modified for use as a vector capable of introducing and directing the 
expression of the CACNA1G gene in host cells (Cone & Mulligan, Proc. Natl. Acad. 
Sci. USA 81:6349-6353, 1984). High level expression may also be achieved using 
inducible promoters, including, but not limited to, the metallothionine IIA promoter and 

20 heat shock promoters. 

For long-term, high-yield production of recombinant proteins, stable expression is 
preferred. Rather than using expression vectors which contain viral origins of replication, 
host cells can be transformed with the CACNA1G cDNA controlled by appropriate 
25 expression control elements (e.g., promoter, enhancer, sequences, transcription 

terminators, polyadenylation sites, etc.), and a selectable marker. The selectable marker in 
the recombinant plasmid confers resistance to the selection and allows cells to stably 
integrate the plasmid into their chromosomes and grow to form foci which in turn can be 
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cloned and expanded into cell lines. For example, following the introduction of foreign 
DNA, engineered cells may be allowed to grow for 1-2 days in an enriched media, and 
then are switched to a selective media. A number of selection systems may be used, 
including but not limited to the herpes simplex virus thymidine kinase (Wigler, et al., 
5 Cell, 1 1 :223, 1 977), hypoxanthine-guanine phosphoribosyltransferase (Szybalska & 
Szybalski, Proc. Natl. Acad. Sci. USA, 48:2026, 1962), and adenine 
phosphoribosyltransferase (Lowy, et al., Cell, 22: 817, 1980) genese can be employed in 
tk.sup.-, hgprt.sup.- or aprt.sup.- cells respectively. Also, antimetabolite resistance can be 
used as the basis of selection for dhfr, which confers resistance to methotrexate (Wigler, 

10 et al., Natl. Acad. Sci. USA, 77:3567, 1980; O'Hare, et al., Proc. Natl. Acad. Sci. USA, 
78: 1527, 1981); gpt, which confers resistance to mycophenolic acid (Mulligan & Berg, 
Proc. Natl. Acad. Sci. USA, 78: 2072, 1981; neo which confers resistance to the 
aminoglycoside G-418 (Colberre-Garapin, et al., J. Mol. Biol., 150:1, 1981); and hygro, 
which confers resistance to hygromycin (Santerre, et al., Gene, 30:147, 1984) genes. 

15 Recently, additional selectable genes have been described, namely trpB, which allows 

cells to utilize indole in place of tryptophan; hisD, which allows cells to utilize histinol in 
place of histidine (Hartman & Mulligan, Proc. Natl. Acad. Sci. USA, 85:8047, 1988); and 
ODC (ornithine decarboxylase) which confers resistance to the ornithine decarboxylase 
inhibitor, 2-(difluoromethyl)-DL-ornithine, DFMO (McConlogue L., 1987, In: Current 

20 Communications in Molecular Biology, Cold Spring Harbor Laboratory, ed.). 

Isolation and purification of microbial expressed polypeptide, or fragments 
thereof, provided by the invention, may be carried out by conventional means including 
preparative chromatography and immunological separations involving monoclonal or 
25 polyclonal antibodies. 



In one embodiment, the invention provides substantially purified polypeptide 
encoded by CACNA1G polynucleotide sequences. Exemplary CACNA1G polypeptide 
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is set forth in SEQ ID NO:52. The term "substantially purified" as used herein refers to a 
polypeptide which is substantially free of other proteins, lipids, carbohydrates or other 
materials with which it is naturally associated. One skilled in the art can purify a 
polypeptide encoded by CACNA1G polynucleotide sequence using standard techniques 
5 for protein purification. The substantially pure polypeptide will yield a single major band 
on a non-reducing polyacrylamide gel. The purity of the CACNA1G polypeptide can also 
be determined by ammo-terminal amino acid sequence analysis. 

Minor modifications of the CACNA1G primary amino acid sequences may result 
10 in proteins which have substantially equivalent activity as compared to the unmodified 
counterpart polypeptide described herein. Such modifications may be deliberate, as by 
site-directed mutagenesis, or may be spontaneous. All of the polypeptides produced by 
these modifications are included herein as long as the biological activity still ex-ists. 

15 The polypeptides of the invention also include dominant negative forms of the 

CACNA1G polypeptide which do not have the biological activity of CACNA1G 
polypeptide sequence. A "dominant negative form" of CACNA1G is a polypeptide that 
is structurally similar to CACNA1G polypeptide but does not have wild-type CACNA1G 
function. For example, a dominant-negative CACNA1G polypeptide may interfere with 

20 wild-type CACNA1 G function by binding to, or otherwise sequestering, regulating 

agents, such as upstream or downstream components, that normally interact functionally 
with the CACNA1G polypeptide. 

Identification and Isolation of CACNA1G 

25 

To identify genes differentially methylated in colorectal cancer, methylated CpG 
island amplification was used followed by representational difference analysis (Razin and 
Cedar, Cell 12: 473-476, 1994, herein incorporated by reference). One of the clones 
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recovered (MINT31, see U.S. Patent Application Serial No. 09/309,175, incorporated by 
reference herein in its entirety) mapped to human chromosome 17q21 using a radiation 
hybrid panel. A Blast search revealed this fragment to be completely identical to part of a 
BAC clone (Genbank: AC004590) sequenced by high throughput genomic sequencing. 

5 The region surrounding MINT3 1 fulfills the criteria of a CpG island: GC content 0.67, 
CpG/GpC ratio 0.78 and a total of 305 CpG sites in a 4 kb region. Using this CpG island 
and 10 kb of flanking sequences in a Blast analysis, several regions highly homologous to 
the rat T-type calcium channel gene, CACNA1G, were identified (Perez-Reyes et al., 
Nature 391: 896-900. 1998, herein incorporated by reference). Several ESTs were also 

10 identified in this region. Using Genscan, 2 putative coding sequences (Gl, and G2) were 
identified. Blastp analysis revealed that Gl has a high homology to the EH-domain- 
binding protein, epsin, while G2 is homologous to a C. elegans hypothetical protein 
(accession No. 2496828). 

15 The MINT3 1 CpG island corresponds to the 3' regions of Gl and G2, based on 

the direction of the open reading frame and the presence of a poly A tail, and, without 
being bound by theory, is unlikely to influence their transcription. The EST closest to 
MINT31 (HI 3333) (SEQ ID NO: 120; Figure 5) was sequenced entirely and was found 
not to contain a continuous open reading frame, but a poly-adenylation signal was 

20 identified on one end, along with a poly A tail. These data suggest that H13333 

corresponds to the last 2 exons of an unidentified gene. MINT3 1 is in the intron of this 
gene and is, again without being bound by theory, unlikely to influence transcription of 
the unidentified gene. However, based on both promoter prediction (TSSG) analysis of 
this region and homology to the rat CACNA1G sequence, the MTNT31 CpG island is also 

25 in the 5' region of human CACNA1 G gene and is likely to play a role in its 
transcriptional activity. 
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The human CACNA1G sequence deposited in Genbank lacks the 5' region of the 
gene, when compared to the rat homologue. To determine the 5' region of human 
CACNA1G, cDNA was amplified by RT-PCR using primers based on the BAC sequence 
(Genbank: AC004590, herein incorporated by reference). The PCR products were cloned 

5 and sequenced, and the genomic organization of the gene was determined by comparing 
the newly identified sequences as well as the known sequences to the BAC that covers 
this region. CACNA1G is composed of 34 exons which span a 70 kb area. (See, Figure 
3 A and SEQ ID NO: 52). Based on sequences deposited in Genbank, the gene has two 
possible 3' ends caused by alternate splicing. Human CACNA1G is highly homologous to 

10 rat CACNA1 G with 93% identity at the protein level, and 89% identity at the nucleotide 
level. The 5' flanking region of human CACNA1G lacks TATA and CAAT boxes, which 
is similar to many housekeeping genes. A putative TFIID binding site was identified 547- 
556 bp upstream from the translation start site, and several other potential transcription 
factor binding sites such as API (1 site), AP2 (2 sites) and SP1 (10 sites), were identified 

1 5 upstream of CACNA1 G exon 1 using the promoter prediction program, TESS. 



Methvlation Analysis of CACNA1G 



The CACNA1G CpG island is 4 kb, and is larger than many typical CpG islands. 
20 MTNT3 1 corresponds to the 5' edge of the island while CACNA1G is in the 3' region. It 
is not known whether large CpG islands such as this are coordinately regulated with 
regards to protection from methylation, and aberrant methylation in cancer. 

To address this issue, the methylation status of the 5' region of CACNA1G was 
25 studied using bisulfite-PCR of DNA from normal tissues as well as 35 human cancer cell 
lines from colon, lung, prostate, breast and hematopoietic tumors. More specifically, 
forty-nine primary colorectal cancers, 28 colorectal adenomas, 16 primary gastric cancers 
and 17 acute myelogenous leukemia samples were used for methylation analyses. DNA 
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from eight colon cancer cell lines (Caco2, RKO, SW48, HCT116, DLD1, Lovo, SW837, 
HT29), 4 lung cancer cell lines (OH3, H249, HI 57, H209), 4 glioblastoma cell fines 
(Dauy, D283, U87, U373), 8 breast cancer cell lines (MB-468, MCF7, MB-231, MB-474, 
MB-435, MB-453, BT20, CAMA1, SKBR3), 7 hematopoietic tumor cell lines (CEM, 

5 Raji, KG1 A, HL60, ML-l,MoIt3, K562), and 4 prostate cancer cell lines (DU145, 

DUPRO, LNCAP, TSUPRL) were also investigated. The CpG island was divided into 8 
regions (SEQ ID NOs: 35-42, respectively). The methylation status of each region was 
examined separately. The genomic DNA was treated with sodium bisulfite and PCR 
amplified using primers containing no or a minimum number of CpG sites. (For a 

10 detailed description of bisulfite-PCR, see, U.S. Patent No. 5,786, 146, incorporated herein 
by reference in its entirety). Methylated alleles were detected by digesting the PCR 
products using restriction enzymes which specifically cleave sites created or retained due 
to the presence of methylated CpGs. None of the regions was methylated in normal 
colon, consistent with a uniform protection against de-novo methylation. 

15 

Regions 1 and 2 were frequently methylated in cancer cell lines, and behaved in a 
concordant manner with respect to methylation pattern. Indeed, these 2 regions were 
methylated in most cancer cell types except gliomas. Moreover, most cell lines where 
methylation was found methylated both regions 1 and 2. in contrast, region 3, which is 

20 less CG rich than any of the other regions, had either no methylation or very low levels of 
methylation in most cell lines. Regions 5, 6, and 7 behaved quite differently compared to 
1-3. Methylation of these regions was less frequent than regions 1-2, as 22/35 cell lines 
had no detectable methylation there, despite often showing methylation of region 1-2. 
However, when methylation of regions 5, 6, or 7 was present (in 13/35 cell lines), it 

25 affected all 3 regions in a coordinate manner, although to varying extents. Finally, regions 
4 and 8 behaved differently again, being partially methylated primarily in colon and 
breast cell lines. Therefore, with regards to hypermethylation in cancer cells, the CpG 
rich region upstream of CACNA1G appears to be composed of 2 CpG islands which 
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behave independently. MINT31 corresponds to the upstream CpG island (island 1, 
regions 1 and 2), while the 5' region of CACNA1G is contained in the downstream CpG 
island (island 2, regions 5-7). Regions 3, 4 and 8 correspond to the edge of these CpG 
islands, and behave a little differently than the hearts of the CpG islands, as previously 
5 described for the E-Cad gene (Graff, et al., J. Biol. Chem. 272: 22322-22329, 1997). 

Overall, the methylation patterns of CACNA1G fell into 5 distinct categories: (1) 
No methylation in any region (normal tissue). (2) Slight methylation of island 1 (6 cell 
lines, e.g., TSU-PRL). (3) Heavy methylation of island 1 but no methylation of island 2 
10 (16 cell lines, e.g., Caco2). (4) Heavy methylation of island 1 and moderate to heavy 

methylation of island 2 (6 cell lines, e.g., RKO and Raji). (5) High methylation of island 
1 and low to moderate methylation of island 2 (7 cell lines, e.g., MB-23 1). 

Methylation Depe ndent Expression of CACNA1G 

15 

In a previous study, rat CACNA1G was shown to be expressed most abundantly 
in the brain (Perez-Reyes et al., Nature 391: 896-900. 1998). To determine the expression 
of CACNA1G in normal and neoplastic human cells, RT-PCR was performed using 
cDNA from various normal tissues and from a panel of 27 tumor cell lines. CACNA1G 

20 was expressed ubiquitously in a variety of tissues and cell lines. In normal tissues 

expression was relatively low but easily detectable, while most cell lines had relatively 
high expression of CACNA1G. However, some cell lines had negligible or totally absent 
levels of CACNA1G expression. The results of CACNA1G expression was correlated 
with the detailed methylation analysis previously described. In this analysis, a remarkable 

25 pattern emerged. Methylation of regions 1-4 and 8 had no effect on CACNA1G 

expression. However, there was a strong correlation between methylation of regions 5-7 
and expression of the gene. In fact, all cell lines tested that lack methylation of this region 
strongly express the gene. All 6 cell lines with pattern 4 methylation studied had no 
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detectable expression. Finally, the 7 cell lines with pattern 5 methylation (examples 
DLD-1 and MB-453) had variable levels of expression ranging from very low to near 
normal. The fact that patterns 3 and 5 differ significantly with regards to expression, but 
are almost identical with regards to methylation of all regions except 7 indicates that this 
5 area is important in the inactivation of C ACNA 1 G. 

To confirm that methylation of the 5' CpG island of CACNA1G is associated with 
gene inactivation, 3 non-expressing cell lines showing pattern 4 methylation (RKO, 
SW48 and Raji) and 2 weakly expressing cell lines showing pattern 5 methylation (MB- 
10 23 1 and MB-435) were treated with 1 M of the methyl-transferase inhibitor 5-deoxy- 
azacitidine. After treatment, all of these cell lines re-expressed CACNA1G mRNA. 
Consistent with re-expression, demethylation of region 7 was observed after 5-deoxy- 
azacitidine treatment. 

15 De novo cytosine methylation is thought to sometimes occur in vitro during cell 

propagation (Antequera et al., Cell 62: 503-514, 1990). To determine whether the 
methylation of CACNA1G occurs in vivo, primary human tumors were examined for 
methylation of the 5' region of CACNA1G. Aberrant methylation was detected in 17 out 
of 49 (35%) colorectal cancers, 4 out of 28 colorectal adenomas (25%), 4 out of 16 (25%) 

20 gastric cancers and 3 out of 17 (18%) acute myelogenous leukemia cases. In colorectal 
cancers, there was a significant correlation between methylation of CACNA1G and 
methylation of pi 6 (p<0.005) and hMLHl (pO.OOl), as well as a strong correlation with 
the presence of microsatellite instability, and the recently identified CpG island 
methylator phenotype (CIMP), indicating that CACNA1G is also a target for CIMP in 

25 colorectal cancer. 

To determine whether aberrant methylation of the 5' region of CACNA1G affects 
the expression status of this gene in primary tumors, RT-PCR was performed using 



cDNA from a series of colorectal adenomas. Six out of 8 cases which showed no 
methylation of region 7 expressed CACNA1G. In sharp contrast, all 5 cases that showed 
methylation of region 7 had no detectable expression of the gene. 



5 Thus, a human T-type calcium channel gene (CACNAIG) has been identified and 

cloned using the MINT3 1 sequence as a probe. The human T-type calcium channel gene 
has been determined to be a target of aberrant methylation and silencing in human 
tumors. The data show that MINT31 (for a representative sequence of MINTl-33, see, 
US Patent Application Serial No. 09/309,175) hereby incorporated by reference can be 

10 used as a probe to identify genes that play a role in disorders such as cell proliferative 
disorders. 

Detailed analysis of the CpG island upstream of CACNAIG revealed that 
methylation 300 to 800 bp upstream of the gene closely correlated with transcriptional 

1 5 inactivation. The CACNA1 G promoter is contained in a large CG rich area that is not 
coordinately methylated in cancer. The CpG island around MINT31 is much more 
frequently methylated in cancers compared to that just upstream of CACNAIG. This may 
simply be caused by differential susceptibility to de-novo methylation between these two 
regions, with methylation of MINT3 1 serving as a trigger, and eventually spreading to 

20 CACNAIG, as described in other genes (Graff, et al., J. Biol. Chem. 272: 22322-22329, 
1997). However, it is likely that these 2 regions are controlled by different mechanisms 
because (1) cell lines kept in culture for countless generations do not in fact spread 
methylation from MINT31 to CACNAIG (e.g., Caco2), (2) region 3 that separates the 2 
islands is infrequently and sparsely methylated in cancer and (3) 2 cases of primary 

25 colorectal cancer were found which are methylated at the CACNA1 G promoter but not at 
MINT31). Therefore, methylation of MTNT31 appears to be independent of methylation 
of CACNAIG suggesting that they are 2 distinct CpG islands regulated by different 
mechanisms. 
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Many CpG islands of silenced genes appear to be methylated uniformly and 
heavily throughout the island (e.g., Graff, et al., J. Biol. Chem. 272: 22322-22329, 1997). 
In contrast the methylation patterns of the 5' region of CACNA1G (region 5-7) was 
5 heterogeneous in the cell lines which did not express this gene. Nevertheless, 

methylation clearly plays a role in CACNA1G repression since demethylation readily 
reactivates the gene. 



The mechanism of CACNA1G methylation remains to be determined. 

10 Methylation was not detected in normal colon mucosa, placenta, normal breast epithelium 
and normal bone marrow, including samples from aged patients, suggesting that 
methylation of this region is specific for cell proliferative disorders such as cancer, and 
the like. However, there was a significant correlation between methylation of 
CACNA1G and other tumor suppressor genes such as pl6 and hMLHl. Thus, 

15 CACNA1G is likely a target for the recently described CIMP phenotype, which results in 
a form of epigenetic instability with simultaneous inactivation of multiple genes. 



T-type calcium channels are involved not only in electrophysiological rhythm 
generation but also in the control of cytosolic calcium during cell proliferation and cell 

20 death (reviewed in Berridge, et al., Nature 395: 645-648, 1998). Expression of 

CACNA1G is not limited to brain and heart, indicating a likely role in other tissues in 
which it is expressed. It has previously been shown that Ca2+ influx via T-type channels 
is an important factor during the initial stages of cell death such as apoptosis (Berridge, et 
al., Nature 3^5: 645-648, 1998), ischemia (Fern, J. Neurosci. 18: 7232-7243, 1998) and 

25 complement-induced cytotoxicity (Newsholme, et al., Biochem. J. 295: 773-779, 1993.). 
The studies culminating in the present invention indicates that impairment of voltage 
gated calcium channels plays an important role in cancer development and progression 
through altering calcium signaling. 
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Due to the clear correlation between methylation of CpG islands and cellular 
proliferative disorders, in another embodiment of the present invention, there are 
provided methods for detecting a cellular proliferative disorder in a subject having or at 
5 risk for said cellular proliferative disorder. The method includes assaying, in nucleic 
acid-containing specimen taken from said subject, the methylation state of a gene or its 
associated regulatory regions, wherein the expression state of the gene or its associated 
regulatory regions is associated with the presence of the cellular proliferative disorder, 
and identifying as having a cellular proliferative disorder a subject that has aberrant 
10 methylation of regions of said gene. The method provides for detecting a cellular 

proliferative disorder in a subject having or at risk for said cellular proliferative disorder 
by identifying aberrantly methylation of regions of a gene when compared to the same 
regions of the gene in a subject not having said cellular proliferative disorder. 

15 The aberrant methylation comprises hypermethylated CpG rich regions (i.e., 

islands). In one aspect of the present invention, the CpG rich regions are associated with 
the CACNA1G gene, and affect the expression thereof in a methylation state-dependent 
manner. A "cell proliferative disorder" or "cellular proliferative disorder" is any disorder 
in which the proliferative capabilities of the affected cells is different from the normal 

20 proliferative capabilities of unaffected cells. An example of a cell proliferative disorder 
is neoplasia. Malignant cells (i.e., cancer) develop as a result of a multistep process. 
Specific, non-limiting examples of cell proliferative disorders associated with increased 
methylation of CpG-islands are low grade astrocytoma, anaplastic astrocytoma, 
glioblastoma, medulloblastoma, gastric cancer, colorectal cancer, colorectal adenoma, 

25 acute myelogenous leukemia, lung cancer, renal cancer, leukemia, breast cancer, prostate 
cancer, endometrial cancer and neuroblastoma. 

A cell proliferative disorder as described herein may be a neoplasm. Such 
neoplasms are either benign or malignant. The term "neoplasm" refers to a new, 
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abnormal growth of cells or a growth of abnormal cells that reproduce faster than normal. 
A neoplasm creates an unstructured mass (a tumor) which can be either benign or 
malignant. For example, the neoplasm may be a head, neck, lung, esophageal, stomach, 
prostate, small bowel, colon, bladder, kidney, or cervical neoplasm. The term "benign" 
5 refers to a tumor that is noncancerous, e.g. its cells do not proliferate or invade 

surrounding tissues. The term "malignant" refers to a tumor that is metastastic or no 
longer under normal cellular growth control. 



A cell proliferative disorder may be an age-associated disorder. Examples of age- 
10 associated disorders which are cell proliferative disorders include colon cancer, lung 
cancer, breast cancer, prostate cancer, leukemia and melanoma, amongst others. 

A "nucleic acid containing specimen" includes any type of material containing a 
nucleic acid to be subject to invention methods. The nucleic acid may be contained in a 
15 biological sample. Such samples include but are not limited to any bodily fluid, such as a 
serum, urine, saliva, blood, cerebrospinal fluid, pleural fluid, ascites fluid, sputum, stool, 
or a biopsy sample. 



Samples or specimens include any CpG-rich DNA sequence, whatever the origin, 
20 as long as the sequence is detectably present in a sample. While routine diagnostic tests 
may not be able to identify cancer cells in these samples, the method of the present 
invention identifies neoplastic cells derived from the primary tumor or from a metastases. 
The method includes non-invasive sampling (e.g., bodily fluid) as well as invasive 
sampling (e.g., biopsy). The sample of DNA of the subject may be serum, plasma, 
25 lymphocytes, urine, sputum, bile, stool, cervical tissue, saliva, tears, cerebral spinal fluid, 
regional lymph node, histopathologic margins, and any bodily fluid that drains a body 
cavity or organ. Therefore, the method provides for the non-invasive detection of various 
tumor types including head and neck cancer, lung cancer, esophageal cancer, stomach 
cancer, small bowel cancer, colon cancer, bladder cancer, kidney cancers, cervical cancer 
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and any other organ type that has a draining fluid accessible to analysis. For example, 
neoplasia of regional lymph nodes associated with a primary mammary tumor can be 
detected using the method of the invention. Regional lymph nodes for head and neck 
carcinomas include cervical lymph nodes, prelaryngeal lymph nodes, pulmonary juxta- 
5 esophageal lymph nodes and submandibular lymph nodes. Regional lymph nodes for 
mammary tissue carcinomas include the axillary and intercostal nodes. Samples also 
include urine DNA for bladder cancer or plasma or saliva DNA for head and neck cancer 
patients. 

10 Any nucleic acid sample, in purified or nonpurified form, can be utilized as the 

starting nucleic acid or acids in accordance with the present invention, provided it 
contains, or is suspected of containing, a nucleic acid sequence containing a target locus 
(e.g., CpG-containing nucleic acid). In general, the CpG-containing nucleic acid is DNA. 
However, invention methods may employ, for example, samples that contain DNA, or 

1 5 DNA and RNA, including messenger RNA, wherein DNA or RNA may be single 

stranded or double stranded, or a DNA-RNA hybrid may be included in the sample. A 
mixture of nucleic acids may also be employed. The specific nucleic acid sequence to be 
detected may be a fraction of a larger molecule or can be present initially as a discrete 
molecule, so that the specific sequence constitutes the entire nucleic acid. It is not 

20 necessary that the sequence to be studied be present initially in a pure form; the nucleic 
acid may be a minor fraction of a complex mixture, such as contained in whole human 
DNA. The nucleic acid-containing sample used for detection of methylated CpG may be 
from any source including, but not limited to, brain, colon, urogenital, lung, renal, 
pancreas, liver, esophagus, stomach, hematopoietic, breast, thymus, testis, ovarian, 

25 prostate and uterine tissue, and may be extracted by a variety of techniques such as that 
described by Maniatis, et al. Molecular C loning: a Laboratory Manual. Cold Spring 
Harbor, NY, pp 280, 281, 1982). 
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The nucleic acid of interest can be any nucleic acid where it is desirable to detect 
the presence of a differentially methylated CpG island. The CpG island comprises a CpG 
island located in a gene or regulatory region for a gene. A "CpG island" is a CpG rich 
region of a nucleic acid sequence. The nucleic acid sequence may include, for example, 
5 APOB, CACNA1G, CDX2, EGFR, FBN1, GPR37, HSPA6, IQGAP2, KL, PAR2, 
PITX2, PTCH, SDC1, or SDC4 (see for example Figures 4A-4F). Alternatively the 
nucleic acid of interest can be, for example, a MINT31 nucleic acid sequence (SEQ ID 
NO: 120. However, any gene or nucleic acid sequence of interest containing a CpG 
sequence can provide diagnostic information (i.e., via its mmethylation state) using 
10 invention methods. 

Moreover, these markers can also be multiplexed in a single amplification 
reaction to generate a low cost, reliable cancer screening test for many cancers 
simultaneously. A combination of DNA markers for CpG-rich regions of nucleic acid 

15 may be amplified in a single amplification reaction. The markers are multiplexed in a 
single amplification reaction, for example, by combining primers for more than one 
locus. For example, DNA from a urine sample can be amplified with three different 
randomly labeled primer sets, such as those used for the amplification of the CACNA1G, 
EGFR and PTCH loci, in the same amplification reaction. The reaction products are 

20 separated on a denaturing polyacrylamide gel, for example, and then exposed to film for 
visualization and analysis. By analyzing a panel of markers, there is a greater probability 
of producing a more useful methylation profile for a subject. 

If the sample is impure (e.g., plasma, serum, stool, ejaculate, sputum, saliva, 
25 cerebrospinal fluid, or blood or a sample embedded in paraffin), it may be treated before 
amplification with a reagent effective for lysing the cells contained in the fluids, tissues, 
or animal cell membranes of the sample, and for exposing the nucleic acid(s) contained 
therein. Methods for purifying or partially purifying nucleic acid from a sample are well 
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known in the art (e.g.,Sambrook et al., Molecular C loning: a Laboratory Manual. Cold 
Spring Harbor Press, 1989, herein incorporated by reference). 

In order to detect a differential methylation state for a gene or CpG-containing 
5 region of interest, invention methods include any means known in the art for detecting 
such differential methylation. For example, detecting the differential methylation may 
include contacting the nucleic acid-containing specimen with an agent that modifies 
unmethylated cytosine, amplifying a CpG-containing nucleic acid in the specimen by 
means of CpG-specific oligonucleotide primers, wherein the oligonucleotide primers 
10 distinguish between modified methylated and nonmethylated nucleic acid, and detecting 
the methylated nucleic acid based on the presence or absence of amplification products 
produced in said amplifying step. This embodiment includes the PCR-based methods 
described in U.S. Patent No. 5,786,146, incorporated herein in its entirety. 

15 For the first time, the methylation state of a number of genes has been correlated 

with cell proliferative disorders. Examples of such genes, primers useful for identifying 
their methylation state, and general PCR conditions are set out in Table 1 . 
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TABLE 1 Bisulfite-PCR Primers 







5' rimer 


3 '-primer Annealing Temperature (oC) 




APOB 


v'^mfTt ott+oaatttttt V 
J -gLllllgI^gUXggglllUl-.j 


5 -RccaacaacRccaacaac-3 


57 






TaqI 










See Table 2 








R6 


5 '-ttgtggygttggygatagtt-3 ' 


- 

5'-acraaaaaaaaaaaaaaaaatctctt-3 
















R7 


5 -gggggygttttttttyggatttt-3 


5'-ttcccctacrcccctaaaacttcc-3 


















j -gtaggttagagggagggarygtt-^ 


5'-aaaacaaacctcaccatactacct-3 








Taql 








EGFR 


5 ' -tttgatttYgtttagtattgat-3 ' 


5 -cccttacctttcttttcct-3 








Hinfl, Taql 










J -UtlLllallg I gtuidllUgO 


5 -tttccccacctcttcaaata-3 


54 














GPR37 


5 -ggttaggtggggtaagagttt-3 


5 ' -aacRtttaatccaattacaaacc-3 ' 


56 






Hinfl 








HSPA6 


5'- ttttttagtagttttgagttagagg-3' 


5' - tttcatccttttacacctccc-3' 














20 


IQ 


5 ' -tttatttttattttagtgttag-3 ' 


5 ' -ctcttcRtataacatcctac-3 ' 
















KL 


5 - gtagtatglLygttagygltL-j 


5'- aaataccctaaaaaaaaccc-3' 


60 






T f q ! ,„,,,,^ 








PAR2 


5 -YggtttttttGaaatttaattc-3 


5 -aactccRcatcctcctaaa-3 


45 


25 




Hinfl, Taql 




60 




PITX2 


5'- taagtgtttygttaggtttttt-3 ' 


5'- ccaaactccactacacaataac-3' 






Taql 








PTCHA 


5'- gttggtttgttaatyggagt-3' 


5'- ttaccaaccraaaccatatt-3' 


60 


30 




Taql 


5'- taaaacaaccaactacaacttac-3 ' 


60 




PTCHB 


5'- aatgtgtggaatttaggga-3' 






Taql 




60 




SDC1 


5'- agaggaattyggtagtagagag-3' 


5'- cacaaccaaaaccaaaac-3' 






Taql 






35 


SDC4 


5'-ggggatttgtttggtagtgg-3 ' 


5 ' -cccgaaattccccaataaa-3 ' 


56 



Hinfl 

Note: Y = C or T, R = G or A; row 1-SEQ ID NO: 1 and 2; row 2-SEQ ID NO:3 and 4 and so forth through 
to SEQ ID NO:32, respectively. The gene sequences can be found in a gene database found at 
40 http://www.ncbi.nim.nih.gov/UniGene/index.html. 

In one embodiment, the oligonucleotide primers are specifically targeted to 
CACNA1G and its associated CpG islands as described herein. Examples of 
oligonucleotides suited for determining the methylation state of the 8 regions of the two 
45 CpG islands of MINT3 1/CACNA1 G, as well as PCR conditions and useful methylation 
sensitive restriction endonucleases are set out in Table 2. 
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TABLE 2 Primers Useful For Bisulfite/PCR Analysis of CACNA1 G 



Region Primer set, forward /reverse 



Region 1 
F165 
Region 2 
GM2 
Region 3 
GM3 
Region 4 
GM5 
Region 5 
GM1 
Region 6 
GM6 
Region 7 
GM4 
Region 8 



Region 8 5' 



GAYGGYGTAGTAGTTATTTTGTT-3' 
CATCACCACCCCTCACTTTAC-3' 
■TTYGGGTATTTATAGTTTTTTGGAG-3 ' 
AATCTACCRCCTTCACTCACTC-3' 
■TTTAGGAGYGTTAATGTGAGGTT-3' 
■CTAAAAAAACCCAATCTTAAAAAAAC-3* 
-TGGATAAAGGATGTTTGGGGTTTG-3' 
■CCCTCCCCTTACCCCTAAATCC-3' 
-AATYGGATTTTAGTTGTGGTTTTT-3 ' 
-CACACCACAACTAAATCCCTCACT-3' 
-TTGTGGYGTTGGYGATAGTT-3' 
'-ACRAAAAAAAAAAAAAAAAATCTCTT-3 ' 
'-GGGGG YGTTTTTTTTYGGATTTT-3 ' 
'-TTCCCCTACRCCCCTAAAACTTCC-3 ' 
'-GGGAGTTTGGGAGTTGTATTTTGTT-3' 
'-AACCAAATTAAAAAATCAAACCCTAA-3* 
■GAGGGGGGATYGTAATTTTTTG-3' 
■CCRAAATCTCCTTATTATACCTCCAA-3 ' 



Annealing 
temperature (cycles) 

58 (3), 56 (4), 

54 (5), 52 (26) 
60 (3), 58 (4), 
56 (5), 54 (26) 

55 (3), 53 (4), 
51(5), 49(26) 

55 (5), 53 (5), 
51(5), 49(26) 
60 (3), 58 (4), 

56 (5), 54 (26) 
53 (3), 51 (4), 
49 (5), 47 (26) 

55 (5), 53 (5), 
51(5), 49(26) 
60 (3), 58 (4), 

56 (5), 54 (26) 
60 (3), 58 (4), 
56 (5), 54 (26) 



Restriction 
enzyme 

BstUI 

Maell 

TaqI 

BstUI 

Hinfl 

Maell 

Maell 

TaqI 

BstUI 

TaqI 

Hinfl 

EcoRI 

TaqI 

BstUI 



25 



Region 1 F/R=SEQ ID NO:33 and 34; Region 2=SEQ ID NO:35 and 36 and so forth through 
Region 8 (TaqI)=SEQ ID NO:47 and 48 and Region 8 (BstUI)=SEQ ID NO:49 and 50. 
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Exemplary target regions (i.e., regions 1-8 of MINT31/CACNA1G) that are 
complementary to the primers listed in Table 2 are provided in Table 3: 



Table 3 - Target Sequences 



Region/ 
Primer 


Target set, forward /reverse 


SEQ ID NO 


Corresponding 
SEQ ID NO 


Region 1 


5'- AACAAAATAACTACTACRCCRTC-3 ' 


87 


33 


F165 


5'- GTAAAGTGAGGGGTGGTGATG-3' 


88 


34 


Region 2 


5'- CTCC AAAAAACTATAAATACCCRAA-3 ' 


89 


35 


GM2 


5'- GAGTGAGTGAAGGYGGTAGATT-3' 


90 


36 


Region 3 


5'- AACCTCACATTAACRCTCCTAAA-3' 


91 


37 


GM3 


5'- GTTTTTTTAAGATTGGGTTTTTTTAG-3' 


92 


38 


Region 4 


5'- CAAACCCCAAACATCCTTTATCCA-3 ' 


93 


39 


GM5 


5'- GGATTTAGGGGTAAGGGGAGGG-3 ' 


94 


40 


Region 5 


5'- AAAAACCACAACTAAAATCCRATT-3' 


95 


41 


GM1 


5'- AGTGAGGGATTTAGTTGTGGTGTG-3' 


96 


42 


Region 6 


5'- AACTATCRCCAACRCCACAA-3 ' 


97 


43 


GM6 


5'- AAGAGATTTTTTTTTTTTTTTTTYGT-3' 


98 


44 


Region 7 


5'- AAAATCCRAAAAAAAACRCCCCC-3 ' 


99 


45 


GM4 


5'- GGAAGTTTTAGGGGYGTAGGGGAA-3' 


100 


46 


Region 8 


5'- AACAAAATACAACTCCCAAACACCC-3' 


101 


47 




5'- TTAGGGTTTGATTTTTTAATTTGGTT-3' 


102 


48 


Region 8 


5'- CAAAAAATTACRATCCCCCCTC-3' 


103 


49 




5'- TTGGAGGTATAATAAGGAGATTTYGG-3' 


104 


50 
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TABLE 4 Targets for Bisulfite-PCR Primers 



Genes 




CtLL\> 1U IN KJ 


COr prtoer dmS 
SEQ ID NO 


ArUB j - large i 


5 '-AAAAAACCCAAACTACAAAAAC-3 ' 


55 


1 


— 3 4*!"f f 1 


5'GTTGTTGGRGTTGTTGGR -3' 


56 


2 


R6 5 -target 


5'- AACTATCYCCAACYCCACAA -3' 


57 


3 


— 3 pTf ^ — ; 


5'- AAGAGATTTTTTTTTTTTTTTTTRGT -3' 


58 


4 


Jv7 5 -tsrgct 


5'- AAAATCCYAAAAAAAACYCCCCC —3' 


59 


5 




5'- GGAAGTTTTAGGGGRGTAGGGGAA -3' 


60 


6 


tL/Az j -Targe i 


5'- AACYATCCCTCCCTCTAACCTAC -3' 


61 


7 




5'- AGGTAGTATGGTGAGGTTTGTTTT -3' 


62 


8 


iivxrJv j -target 


5'- ATCAATACTAAACRAAATCAAA -3' 


63 


9 




5'- AGGAAAAGAAAGGTAAGGG -3' 


64 


10 




5'- CAAAATTAACRCAATAAAAAAA -3' 


65 


11 




5'- TATTTGAAGAGGTGGGGAAA -3' 


66 


12 


GPR37 5 -target 


5'- AAACTCTTACCCCACCTAACC -3' 


67 


13 




5'- GGTTTGTAATTGGATTAAAYGTT -3' 


68 


14 


Jrlor'AO D -larger 


5'- CCACTAACTCAAAACTAAAAAA —3' 


69 


15 




5'- GGGAGGTGTAAAAGGATGAAA — 3' 


70 


16 


li^uArZ j -larger 


5'- CTAACACTAAAATAAAAATAAA —3' 


71 


17 


— 3 '^p 1 


5'- GTAGGATGTTATAYGAAGAG —3' 


72 


18 


KL 5 -target 


5'- AAACRCTAACRAACATACTAC —3' 


73 


19 


— 3 '" tar | e ^ 


c ? r.finTTTTTTTT A r.r.P.T A TTT ' 
D - UUU i I 1 1 1 1 1 1 /\UU^J 1A111 -J 


74 


20 


PAR2 5 -target 


D - uuul 1 1 1 i 1 1 lAuuulftl no 


75 


21 


3 '-target 


xTTAnnArirjATrtvrtrjAriTT .3' 

J - 1 1 1 AUUAUUA 1 VJ I VJVJAWJ 1 1 -j 


76 


22 


PITX2 5 '-target 


5'- AAAAAACCTAACRAAACACTTA -3' 


77 


23 


3 '-target 


5'- GTTATTGTGTAGTGGAGTTTGG -3' 


78 


24 


PTCHA 5 '-target 


5'- ACTCCRATTAACAAACCAAC -3' 


79 


25 


3 '-target 


5'- AATATGGTTTYGGTTGGTAA -3' 


80 


26 


PTCHB 5 '-target 


5'- TCCCTAAATTCCACACATT -3' 


81 


27 


3 '-target 


5' - GTAAGTTGTAGTTGGTTGTTTTA -3' 


82 


28 


SDC1 5 '-target 


5'- CTCTCTACTACCRAATTCCTCT -3' 


83 


29 


3 '-target 


5'- GTTTTGGTTTTGGTTGTG -3' 


84 


30 


SDC4 5 '-target 


5'- CCACTACCAAACAAATCCCC -3' 


85 


31 


3 '-target 


5'- TTTATTGGGGAATTTCGGG -3' 


86 


32 
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Table 5 New genes differentially methylated in disease versus normal issue 



Gene 
Symbol 


Gene name 


Map 


Unigene 
Entry 1 


Methylated In 2 


APOB 


Apolipoprotein B 


2p24 


Hs.585 


Common Tumors 


CACNAIG 


T-type calcium channel 


17 






CDX2 


Caudal type homeo box 
transcription factor 2 


13ql2,3 


Hs.77399 


Leukemias, breast, 
prostate 


EGFR 


Epidermal Growth 
Factor Receptor 


7pl2 


Hs.77432 


Leukemias, breast 


FBN1 


Fibrillin- 1 


15q21.1 


Hs.750 


Colon, Breast, prostate, 
leukemias 


GPR37 


G protein-coupled 
receptor 37 


7q31 


Hs.27747 


colon, breast, leukemias 


HSPA6 


Heat shock 70kD protein 
6 (HSP70B') 


iq 


Hs.3268 


Common tumors 


IQGAP2 


RasGAP-related protein 




Hs.78993 


Common tumors 


KL 


Klotho 


- 


Hs.94592 


Common tumors 


PAR2 


Proteinase-activated 
receptor 2 


5ql3 


Hs. 154299 


Leukemias, breast 


PITX2 


Paired- like 
homeodomain 
transcription factor 2 


4q25-27 


Hs.92282 


Leukemias, prostate, 
breast 


PTCH 3 


Patched 


9Q31 


Hs.159526 


Leukemias 


SDC1 


Syndecan 1 


2p24.1 


Hs.82109 


Leukemias 


SDC4 


Syndecan 4 


20ql2 


Hs.72082 


Leukemias 


Table 5: Newge 


nes differentially methylated in cancel 


• and other diseases. 







1 Gene database that can be found at http://www.ncbi.nlm.nih.gov/UniGene/index.html. 

2 Examples: List is not comprehensive. 

3 Two promoters are affected. 
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In another embodiment, detection of differential methylation is accomplished by 
contacting a nucleic acid sample suspected of comprising a CpG-containing nucleic acid 
with a methylation sensitive restriction endonuclease that cleaves only unmethylated CpG 

5 sites under conditions and for a time to allow cleavage of unmethylated nucleic acid. The 
sample is further contacted with an isoschizomer of the methylation sensitive restriction 
endonuclease, that cleaves both methylated and unmethylated CpG-sites, under 
conditions and for a time to allow cleavage of methylated nucleic acid. Oligonucleotides 
are added to the nucleic acid sample under conditions and for a time to allow ligation of 

10 the oligonucleotides to nucleic acid cleaved by the restriction endonuclease, and the 

digested nucleic acid is amplified by conventional methods such as PCR wherein primers 
complementary to the oligonucleotides are employed. Following identification, the 
methylated CpG-containing nucleic acid can be cloned, using method well known to one 
of skill in the art (see Sambrook et al., Molecular C loning: a Laboratory Manual. Cold 

15 Spring Harbor Press, 1989). 

As used herein, a "methylation sensitive restriction endonuclease" is a restriction 
endonuclease that includes CG as part of its recognition site and has altered activity when 
the C is methylated as compared to when the C is not methylated. Preferably, the 

20 methylation sensitive restriction endonuclease has inhibited activity when the C is 
methylated (e.g., Smal). Specific non-limiting examples of a methylation sensitive 
restriction endonucleases include Sma I, BssHII, or Hpall. Such enzymes can be used 
alone or in combination. Other methylation sensitive restriction endonucleases will be 
known to those of skill in the art and include, but are not limited to SacII, EagI, and 

25 BstUI, for example. An "isoschizomer" of a methylation sensitive restriction 

endonuclease is a restriction endonuclease which recognizes the same recognition site as 
a methylation sensitive restriction endonuclease but which cleaves both methylated and 
unmethylated CGs. One of skill in the art can readily determine appropriate conditions for 
a restriction endonuclease to cleave a nucleic acid (see Sambrook et al., Molecular 
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Cloning: a Laboratory Manual . Cold Spring Harbor Press, 1989). Without being bound 
by theory, actively transcribed genes generally contain fewer methylated CGs than in 
other genes. 

5 In one embodiment of the invention, a nucleic acid of interest is cleaved with a 

methylation sensitive endonuclease. In one aspect, cleavage with the methylation 
sensitive endonuclease creates a sufficient overhang on the nucleic acid of interest. 
Following cleavage with the isoschizomer, the cleavage product can still have a 
sufficient overhang. An "overhang" refers to nucleic acid having two strands wherein the 

10 strands end in such a manner that a few bases of one strand are not base paired to the 

other strand. A "sufficient overhang" refers to an overhang of sufficient length to allow 
specific hybridization of an oligonucleotide of interest. In one embodiment, a sufficient 
overhang is at least two bases in length. In another embodiment, the sufficient overhang 
is four or more bases in length. An overhang of a specific sequence on the nucleic acid of 

15 interest may be desired in order for an oligonucleotide of interest to hybridize. In this 
case, the isoschizomer can be used to create the overhang having the desired sequence on 
the nucleic acid of interest. 

In another aspect of this embodiment, the cleavage with a methylation sensitive 
20 endonuclease results in a reaction product of the nucleic acid of interest that has a blunt 
end or an insufficient overhang. In this embodiment, an isoschizomer of the methylation 
sensitive restriction endonuclease can create a sufficient overhang on the nucleic acid of 
interest. "Blunt ends" refers to a flush ending of two stands, the sense stand and the 
antisense strand, of a nucleic acid. 



Once a sufficient overhang is created on the nucleic acid of interest, an 
oligonucleotide is ligated to the nucleic acid cleaved of interest which has been cleaved 
by the methylation specific restriction endonuclease. "Ligation" is the attachment of two 
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nucleic acid sequences by base pairing of substantially complementary sequences and/or 
by the formation of covalent bonds between two nucleic acid sequences. In one aspect of 
the present invention, an "oligonucleotide" is a nucleic acid sequence of about 2 up to 
about 40 bases in length. It is presently preferred that the oligonucleotide is from about 
5 15 to 35 bases in length. 

In one embodiment, an adaptor is utilized to create DNA ends of desired sequence 
and overhang. An "adaptor" is a double-stranded nucleic acid sequence with one end that 
has a sufficient single-stranded overhang at one or both ends such that the adaptor can be 

10 ligated by base-pairing to a sufficient overhang on a nucleic acid of interest that has been 
cleaved by a methylation sensitive restriction enzyme or an isoschizomer of a methylation 
sensitive restriction enzyme. Adaptors can be obtained commercially, or two 
oligonucleotides can be utilized to form an adaptor. Thus, in one embodiment, two 
oligonucleotides are used to form an adaptor; these oligonucleotides are substantially 

15 complementary over their entire sequence except for the region(s) at the 5' and/or 3' ends 
that will form a single stranded overhang. The single stranded overhang is 
complementary to an overhang on the nucleic acid cleaved by a methylation sensitive 
restriction enzyme or an isoschizomer of a methylation sensitive restriction enzyme, such 
that the overhang on the nucleic acid of interest will base pair with the 3' or 5' single 

20 stranded end of the adaptor under appropriate conditions. The conditions will vary 

depending on the sequence composition (GC vs AT), the length, and the type of nucleic 
acid (see Sambrook et al., Molecular C loning: a Laboratory Manual. 2nd Ed.; Cold 
Spring Harbor Laboratory Press, Plainview, NY, 1998). 

25 Following the ligation of the oligonucleotide, the nucleic acid of interest is 

amplified using a primer complementary to the oligonucleotide. Specifically, the term 
"primer" as used herein refers to a sequence comprising two or more deoxyribo- 
nucleotides or ribonucleotides, preferably more than three, and more preferably more than 
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eight, wherein the sequence is capable of initiating synthesis of a primer extension 
product, which is substantially complementary to a nucleic acid such as an adaptor or a 
ligated oligonucleotide. Environmental conditions conducive to synthesis include the 
presence of nucleoside triphosphates and an agent for polymerization, such as DNA 

5 polymerase, and a suitable temperature and pH. The primer is preferably single stranded 
for maximum efficiency in amplification, but may be double stranded. If double 
stranded, the primer is first treated to separate its strands before being used to prepare 
extension products. In one embodiment, the primer is an oligodeoxyribo-nucleotide. The 
primer must be sufficiently long to prime the synthesis of extension products in the 

10 presence of the inducing agent for polymerization. The exact length of primer will 

depend on many factors, including temperature, buffer, and nucleotide composition. The 
oligonucleotide primer typically contains 12-20 or more nucleotides, although it may 
contain fewer nucleotides. 

15 Primers of the invention are designed to be "substantially" complementary to each 

strand of the oligonucleotide to be amplified and include the appropriate G or C 
nucleotides as discussed above. This means that the primers must be sufficiently 
complementary to hybridize with their respective strands under conditions which allow 
the agent for polymerization to perform. In other words, the primers should have 

20 sufficient complementarity with a 5' and 3' oligonucleotide to hybridize therewith and 
permit amplification of CpG containing nucleic acid sequence. 

Primers of the invention are employed in the amplification process which is an 
enzymatic chain reaction that produces exponential quantities of target locus relative to 
25 the number of reaction steps involved (e.g., polymerase chain reaction or PCR). 

Typically, one primer is complementary to the negative (-) strand of the locus and the 
other is complementary to the positive (+) strand. Annealing the primers to denatured 
nucleic acid followed by extension with an enzyme, such as the large fragment of DNA 
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Polymerase I (Klenow) and nucleotides, results in newly synthesized + and - strands 
containing the target locus sequence. Because these newly synthesized sequences are 
also templates, repeated cycles of denaturing, primer annealing, and extension results in 
exponential production of the region (i.e., the target locus sequence) defined by the 

5 primer. The product of the chain reaction is a discrete nucleic acid duplex with termini 
corresponding to the ends of the specific primers employed. 

The oligonucleotide primers of the invention may be prepared using any suitable 
method, such as conventional phosphotriester and phosphodiester methods or automated 
embodiments thereof. In one such automated embodiment, diethylphos-phoramidites are 

10 used as starting materials and may be synthesized as described by Beaucage, et al. 
(Tetrahedron Letters, 22: 1 859- 1 862, 1 98 1 ) . One method for synthesizing 
oligonucleotides on a modified solid support is described in U.S. Patent No. 4,458,066. 

Where the CpG-containing nucleic acid sequence of interest contains two 
15 strands, it is necessary to separate the strands of the nucleic acid before it can be used as a 
template for the amplification process. Strand separation can be effected either as a 
separate step or simultaneously with the synthesis of the primer extension products. This 
strand separation can be accomplished using various suitable denaturing conditions, 
including physical, chemical, or enzymatic means, the word "denaturing" includes all 
20 such means. One physical method of separating nucleic acid strands involves heating the 
nucleic acid until it is denatured. Typical heat denaturation may involve temperatures 
ranging from about 80° to 105°C for times ranging from about 1 to 10 minutes. Strand 
separation may also be induced by an enzyme from the class of enzymes known as 
helicases or by the enzyme RecA, which has helicase activity, and in the presence of 
25 riboATP, is known to denature DNA. The reaction conditions suitable for strand 
separation of nucleic acids with helicases are described by Kuhn Hoffmann-Berling 
(CSH-Quantitative Biology, 43:63, 1978) and techniques for using RecA are reviewed in 
C. Radding (Ann. Rev. Genetics, 16:405-437, 1982). 
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When complementary strands of nucleic acid or acids are separated, regardless of 
whether the nucleic acid was originally double or single stranded, the separated strands 
are ready to be used as a template for the synthesis of additional nucleic acid strands. 

5 This synthesis is performed under conditions allowing hybridization of primers to 

templates to occur. Generally synthesis occurs in a buffered aqueous solution, generally 
at a pH of about 7-9. Preferably, a molar excess (for genomic nucleic acid, usually about 
108:1 primer:template) of the two oligonucleotide primers is added to the buffer 
containing the separated template strands. It is understood, however, that the amount of 

10 complementary strand may not be known if the process of the invention is used for 
diagnostic applications, so that the amount of primer relative to the amount of 
complementary strand cannot be determined with certainty. As a practical matter, 
however, the amount of primer added will generally be in molar excess over the amount 
of complementary strand (template) when the sequence to be amplified is contained in a 

15 mixture of complicated long-chain nucleic acid strands, a large molar excess is preferred 
to improve the efficiency of the process. 

The deoxyribonucleoside triphosphates dATP, dCTP, dGTP, and dTTP are added 
to the synthesis mixture, either separately or together with the primers, in adequate 

20 amounts and the resulting solution is heated to about 90°-l 00°C from about 1 to 10 

minutes, preferably from 1 to 4 minutes. After this heating period, the solution is allowed 
to cool to approximately room temperature, which is preferable for the primer 
hybridization. To the cooled mixture is added an appropriate agent for effecting the 
primer extension reaction (called herein "agent for polymerization"), and the reaction is 

25 allowed to occur under conditions known in the art. The agent for polymerization may 
also be added together with the other reagents if it is heat stable. This synthesis (or 
amplification) reaction may occur at room temperature up to a temperature above which 
the agent for polymerization no longer functions. Thus, for example, if DNA polymerase 
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is used as the agent, the temperature is generally no greater than about 40°C. Most 
conveniently the reaction occurs at room temperature. 

The agent for polymerization may be any compound or system which will 
5 function to accomplish the synthesis of primer extension products, including enzymes. 
Suitable enzymes for this purpose include, for example, E. coli DNA polymerase I, 
Klenow fragment of E. coli DNA polymerase I, T4 DNA polymerase, other available 
DNA polymerases, polymerase muteins, reverse transcriptase, and other enzymes, 
including heat-stable enzymes (i.e., those enzymes which perform primer extension after 
10 being subjected to temperatures sufficiently elevated to cause denaturation such as Taq 
DNA polymerase, and the like). Suitable enzymes will facilitate combination of the 
nucleotides in the proper manner to form the primer extension products which are 
complementary to each locus nucleic acid strand. Generally, the synthesis will be 
initiated at the 3' end of each primer and proceed in the 5' direction along the template 
15 strand, until synthesis terminates, producing molecules of different lengths. There may 
be agents for polymerization, however, which initiate synthesis at the 5' end and proceed 
in the other direction, using the same process as described above. 

Preferably, the method of amplifying is by PCR, as described herein and as is 
20 commonly used by those of ordinary skill in the art. However, alternative methods of 
amplification have been described and can also be employed. PCR techniques and many 
variations of PCR are known. Basic PCR techniques are described by Saiki et al. (1988 
Science 239:487-491) and by U.S. Pat. Nos. 4,683,195, 4,683,202 and 4,800,159, which 
are incorporated herein by reference. 

25 

The conditions generally required for PCR include temperature, salt, cation, pH 
and related conditions needed for efficient copying of the master-cut fragment. PCR 
conditions include repeated cycles of heat denaturation (i.e. heating to at least about 
95.degree. C.) and incubation at a temperature permitting primer: adaptor hybridization 
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and copying of the master-cut DNA fragment by the amplification enzyme. Heat stable 
amplification enzymes like the pwo, Thermus aquaticus or Thermococcus litoralis DNA 
polymerases are commercially available which eliminate the need to add enzyme after 
each denaturation cycle. The salt, cation, pH and related factors needed for enzymatic 
amplification activity are available from commercial manufacturers of amplification 
enzymes. 

As provided herein an amplification enzyme is any enzyme which can be used for 
in vitro nucleic acid amplification, e.g. by the above-described procedures. Such 
amplification enzymes include pwo, Escherichia coli DNA polymerase I, Klenow 
fragment of E. coli DNA polymerase I, T4 DNA polymerase, T7 DNA polymerase, 
Thermus aquaticus (Taq) DNA polymerase, Thermococcus litoralis DNA polymerase, 
SP6 RNA polymerase, T7 RNA polymerase, T3 RNA polymerase, T4 polynucleotide 
kinase, Avian Myeloblastosis Virus reverse transcriptase, Moloney Murine Leukemia 
Virus reverse transcriptase, T4 DNA ligase, E. coli DNA ligase or Q.beta. replicase. 
Preferred amplification enzymes are the pwo and Taq polymerases. The pwo enzyme is 
especially preferred because of its fidelity in replicating DNA. 

Once amplified, the nucleic acid can be attached to a solid support, such as a 
membrane, and can be hybridized with any probe of interest, to detect any nucleic acid 
sequence. Several membranes are known to one of skill in the art for the adhesion of 
nucleic acid sequences. Specific non-limiting examples of these membranes include 
nitrocellulose (NITROPURE) or other membranes used in for detection of gene 
expression such as polyvinylchloride, diazotized paper and other commercially available 
membranes such as GENESCREEN, ZETAPROBE (Biorad), and NYTRAN Methods 
for attaching nucleic acids to these membranes are well known to one of skill in the art. 
Alternatively, screening can be done in a liquid phase. 
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In nucleic acid hybridization reactions, the conditions used to achieve a particular 
level of stringency will vary, depending on the nature of the nucleic acids being 
hybridized. For example, the length, degree of complementarity, nucleotide sequence 
composition (e.g., GC v. AT content), and nucleic acid type (e.g., RNA v. DNA) of the 
5 hybridizing regions of the nucleic acids can be considered in selecting hybridization 
conditions. An additional consideration is whether one of the nucleic acids is 
immobilized, for example, on a filter. 

An example of progressively higher stringency conditions is as follows: 2 x 
SSC/0.1% SDS at about room temperature (hybridization conditions); 0.2 x SSC/0.1% 
SDS at about room temperature (low stringency conditions); 0.2 x SSC/0.1% SDS at 
about 42°C (moderate stringency conditions); and 0.1 x SSC at about 68°C (high 
stringency conditions). Washing can be carried out using only one of these conditions, 
e.g., high stringency conditions, or each of the conditions can be used, e.g., for 10-15 
minutes each, in the order listed above, repeating any or all of the steps listed. However, 
as mentioned above, optimal conditions will vary, depending on the particular 
hybridization reaction involved, and can be determined empirically. In general, 
conditions of high stringency are used for the hybridization of the probe of interest. 

20 The probe of interest can be detectably labeled, for example, with a radioisotope, 

a fluorescent compound, a bioluminescent compound, a chemiluminescent compound, a 
metal chelator, or an enzyme. Those of ordinary skill in the art will know of other 
suitable labels for binding to the probe, or will be able to ascertain such, using routine 
experimentation. 

25 

In one embodiment, representational difference analysis (RDA, see Lisitsyn et al., 
Science 259:946-951 , 1993, herein incorporated by reference) can be performed on CpG- 
containing nucleic acid following MCA. MCA utilizes kinetic and subtractive 
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enrichment to purify restriction endonuclease fragments present in one population of 
nucleic acid fragments but not in another. Thus, RDA enables the identification of small 
differences between the sequences of two nucleic acid populations. RDA uses nucleic 
acid from one population as a "tester" and nucleic acid from a second population as a 
"driver" in order to clone probes for single copy sequences present in (or absent from) 
one of the two populations. In one embodiment, nucleic acid from a "normal" individual 
or sample, not having a disorder such as a cell-proliferative disorder is used as a "driver," 
and nucleic acid from an "affected" individual or sample, having the disorder such as a 
cell proliferative disorder is used as a "tester." In one embodiment, the nucleic acid used 
as a "tester" is isolated from an individual having a cell proliferative disorder such as low 
grade astrocytoma, anaplastic astrocytoma, glioblastoma, medulloblastoma, gastric 
cancer, colorectal cancer, colorectal adenoma, acute myelogenous leukemia, leukemia, 
lung cancer, renal cancer, breast cancer, prostate cancer, endometrial cancer and 
neuroblastoma. The nucleic acid used as a "driver" is thus normal astrocytes, normal 
glial cells, normal brain cells, normal gastric cells, normal colorectal cells, normal 
leukocytes, normal lung cells, normal kidney cells, normal breast cells, normal prostate 
cells, normal uterine cells, and normal neurons, respectively. In an additional 
embodiment, the nucleic acid used as a "driver" is isolated from an individual having a 
cell proliferative disorder such as low grade astrocytoma, anaplastic astrocytoma, 
glioblastoma, medulloblastoma, gastric cancer, colorectal cancer, colorectal adenoma, 
acute myelogenous leukemia, leukemia, lung cancer, renal cancer, breast cancer, prostate 
cancer, endometrial cancer and neuroblastoma. The nucleic acid used as a "tester" is thus 
normal astrocytes, normal glial cells, normal brain cells, normal gastric cells, normal 
colorectal cells, normal leukocytes, normal lung cells, normal kidney cells, normal breast 
cells, normal prostate cells, normal uterine cells, and normal neurons, respectively. One 
of skill in the art will readily be able to identify the "tester" nucleic acid useful with to 
identify methylated nucleic acid sequences in given "driver" population. 
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KITS 

The materials for use in the assay of the invention are ideally suited for the 
preparation of a kit. Therefore, in accordance with another embodiment of the present 
invention, there is provided a kit it useful for the detection of a cellular proliferative 
5 disorder in a subject having or at risk for said cellular proliferative disorder. Invention 
kits include a carrier means compartmentalized to receive a sample in close confinement 
therein, one or more containers comprising a first container containing a reagent which 
modifies unmethylated cytosine and a second container containing primers for 
amplification of a CpG-containing nucleic acid, wherein the primers distinguish between 
10 modified methylated and nonmethylated nucleic acid, and optionally, a third container 
O containing a methylation sensitive restriction endonuclease. Primers contemplated for 

y use in accordance with the invention include those set forth in SEQ ID NOs: 1 -50. 

'■■3 

m 

W Carrier means are suited for containing one or more container means such as vials, 

m 

m 15 tubes, and the like, each of the container means comprising one of the separate elements 

q to be used in the method. In view of the description provided herein of invention 

ff methods, those of skill in the art can readily determine the apportionment of the necessary 

ffl reagents among the containiner means. For example, one of the container means can 

comprise a container containing an oligonucleotide for ligation to nucleic acid cleaved by 
20 a methylation sensitive restriction endonuclease. One or more container means can also 
be included comprising a primer complementary to the oligonucleotide. In addition, one 
or more container means can also be included which comprise a methylation sensitive 
restriction endonuclease. One or more container means can also be included containing 
an isoschizomer of said methylation sensitive restriction enzyme. 



In another embodiment, the kit may comprise a carrier means containing one or 
more container means comprising a solid support, wherein the solid support has a nucleic 
acid sequence of CACNA1G as described herein immobilized on the solid support. In 
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one embodiment, the solid support is a membrane. Several membranes are known to one 
of skill in the art for the adhesion of nucleic acid sequences. Specific non-limiting 
examples of these membranes include nitrocellulose (Nitropure) or other membranes used 
in for detection of gene expression such as polyvinylchloride, diazotized paper and other 

5 commercially available membranes such as GENESCREEN, ZETAPROBE (Biorad), and 
NYTRAN The CACNA1G sequences immobilized on the solid support can then be 
hybridized to nucleic acid sequences produced by performing the MCA procedure, 
bisulfite PCR or other methylation detection methods on the nucleic acids of a sample of 
interest in order to determine if the nucleic acid sequences contained in the sample are 

10 methylated. 

The term "oligonucleotide primer" refers to a sequence of two or more deoxyribo- 
nucleotides or ribonucleotides, preferably at least eight, which sequence is capable of 
initiating synthesis of a primer extension product that is substantially complementary to a 
15 target nucleic acid strand. The oligonucleotide primer typically contains fifteen to 

twenty-two or more nucleotides, although it may contain fewer nucleotides if the primer 
is complementary, so as to specifically allow the amplification of the specifically desired 
target nucleotide sequence. 

20 The oligonucleotide primers for use in the invention may be prepared using any 

suitable method, such as conventional phosphotriester and phosphodiester methods or 
automated embodiments thereof. In one such automated embodiment, diethyl-phos- 
phoramidites are used as starting materials and may be synthesized as described by 
Beaucage et al., Tetrahedron Letter, 22: 1859-1862 (1981). One method for synthesizing 

25 oligonucleotides on a modified solid support is described in U.S. Patent No. 4,458,066. 
The exact length of primer will depend on many factors, including temperature, buffer, 
and nucleotide composition. The primer must prime the synthesis of extension products 
in the presence of the inducing agent for amplification. 
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Primers used according to the method of the invention are complementary to each 
strand of mutant nucleotide sequence to be amplified. The term "complementary" means 
that the primers must hybridize with their respective strands under conditions which 
allow the agent for polymerization to function. In other words, the primers that are 
5 complementary to the flanking sequences hybridize with the flanking sequences and 
permit amplification of the nucleotide sequence. Preferably, the 3' terminus of the primer 
that is extended has perfectly base paired complementarity with the complementary 
flanking strand. 

10 The term "flanks CpG-rich regions" refers to those DNA sequences on 

chromosome that are upstream (5') or downstream (3') to the DNA sequence to be 
amplified. The sequence to be amplified is preferably a CpG-rich region in a gene or 
regulatory region associated with a gene. For example, when the nucleotide sequence to 
be amplified is double stranded, a first sequence that is 5' to the nucleotide sequence and 

15 a second sequence that is 5' to the nucleotide sequence on the complementary strand flank 
the CpG-rich DNA sequence. 



The nucleotide sequences that flank nucleotide repeats, i.e., the nucleotide 
sequences to which the oligonucleotide primers hybridize, may be selected from among 
20 the following nucleotide sequences: SEQ ID NO:1-50. 

In general, the primers used according to the method of the invention embrace 
oligonucleotides of sufficient length and appropriate sequence which provide specific 
initiation of polymerization of a significant number of nucleic acid molecules containing 
25 the target nucleic acid under the conditions of stringency for the reaction utilizing the 
primers. In this manner, it is possible to selectively amplify the specific target nucleic 
acid sequence containing the nucleic acid of interest. Oligonucleotide primers used 
according to the invention are employed in any amplification process that produces 
increased quantities of target nucleic acid. 
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The invention includes antibodies immunoreactive with CACNA1G polypeptide 
or functional fragments thereof. Antibody which consists essentially of pooled 
monoclonal antibodies with different epitopic specificities, as well as distinct monoclonal 

5 antibody preparations are provided. Monoclonal antibodies are made from antigen 
containing fragments of the protein by methods well known to those skilled in die art 
(Kohler, et al, Nature, 256:495, 1975). The term antibody as used in this invention is 
meant to include intact molecules as well as fragments thereof, such as Fab and F(ab') 2 , 
Fv and SCA fragments which are capable of binding an epitopic determinant on 

10 CACNA1G . 

(1) An Fab fragment consists of a monovalent antigen-binding fragment of an antibody 
molecule, and can be produced by digestion of a whole antibody molecule with the 
enzyme papain, to yield a fragment consisting of an intact light chain and a portion of a 

15 heavy chain. 

(2) An Fab' fragment of an antibody molecule can be obtained by treating a whole 
antibody molecule with pepsin, followed by reduction, to yield a molecule consisting of 
an intact light chain and a portion of a heavy chain. Two Fab' fragments are obtained per 

20 antibody molecule treated in this manner. 

(3) An (Fab') 2 fragment of an antibody can be obtained by treating a whole antibody 
molecule with the enzyme pepsin, without subsequent reduction. A (Fab') 2 fragment is a 
dimer of two Fab' fragments, held together by two disulfide bonds. 



(4) An Fv fragment is defined as a genetically engineered fragment containing the 
variable region of a light chain and the variable region of a heavy chain expressed as two 
chains. 
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(5) A single chain antibody ("SCA") is a genetically engineered single chain molecule 
containing the variable region of a light chain and the variable region of a heavy chain, 
linked by a suitable, flexible polypeptide linker. 

5 

As used in this invention, the term "epitope" refers to an antigenic determinant on 
an antigen, such as a CACNA1G polypeptide, to which the paratope of an antibody, such 
as an CACNA1G -specific antibody, binds. Antigenic determinants usually consist of 
chemically active surface groupings of molecules, such as amino acids or sugar side 
10 chains, and can have specific three-dimensional structural characteristics, as well as 
specific charge characteristics. 

As is mentioned above, antigens that can be used in producing CACNA1G 
-specific antibodies include CACNA1G polypeptides or CACNA1G polypeptide 

15 fragments. The polypeptide or peptide used to immunize an animal can be obtained by 
standard recombinant, chemical synthetic, or purification methods. As is well known in 
the art, in order to increase immunogenicity, an antigen can be conjugated to a carrier 
protein. Commonly used carriers include keyhole limpet hemocyanin (KLH), 
thyroglobulin, bovine serum albumin (BSA), and tetanus toxoid. The coupled peptide is 

20 then used to immunize the animal (e.g. , a mouse, a rat, or a rabbit). In addition to such 
carriers, well known adjuvants can be administered with the antigen to facilitate induction 
of a strong immune response. 

The invention also provides a method for detecting a cell proliferative disorder 
25 associated with CACNA1G in a subject, comprising contacting a target cellular 

component suspected of having a CACNA1G associated disorder, with a reagent which 
reacts with or binds to CACNA1G and detecting CACNA1G . The target cell component 
can be nucleic acid, such as DNA or RNA, or it can be protein. When the component is 
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nucleic acid, the reagent is typically a nucleic acid probe or PCR primer. When the cell 
component is protein, the reagent is typically an antibody probe. The target cell 
component may be detected directly in situ or it may be isolated from other cell 
components by common methods known to those of skill in the art before contacting with 
5 a probe. (See for example, Maniatis, et al., Molecular Cloning, A Laboratory Manual, 
Cold Spring Harbor Laboratory, N.Y. 1989; Current Protocols in Molecular Biology, 
1994, Ed. Ausubel, et al., Green Publ. Assoc. & Wiley Interscience.) Detection methods 
include Southern and Northern blot analyses, RNase protection, immunoassays and other 
detection assays that are known to those of skill in the art. 

10 

The probes can be detectably labeled, for example, with a radioisotope, a 
fluorescent compound, a bioluminescent compound, a chemiluminescent compound, a 
metal chelator, or an enzyme. Those of ordinary skill in the art will know of other suitable 
labels for binding to the probes or will be able to ascertain such, using routine 
15 experimentation. 

Since the present invention shows that a decreased level of CACNA1G 
transcription is often the result of hypennethylation of the CACNA1G gene, it is often 
desirable to directly determine whether the CACNA1G gene is hypermethylated. In 

20 particular, the cytosine rich areas terms "CpG islands" which lie in the 5' regulatory 

regions of genes are normally unmethylated. The term "hypermethylation" includes any 
methylation of cytosine which is normally unmethylated in the CACNA1G gene 
sequence can be detected by restriction endonuclease treatment of CACNA1G 
polynucleotide (gene) and Southern blot analysis for example. Therefore, in a method of 

25 the invention, when the cellular component detected is DNA, restriction endonuclease 

analysis is preferable to detect hypermethylation of the CACNA1G gene. Any restriction 
endonuclease that includes CG as part of its recognition site and that is inhibited when the 
C is methylated, can be utilized. Methylation sensitive restriction endonucleases such as 
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BssHII, Mspl, NotI or Hpall, used alone or in combination are examples of such 
endonucleases. Other methylation sensitive restriction endonucleases will be known to 
those of skill in the art. In addition, PCR can be utilized to detect the methylation status 
of the CACNA1G gene. Oligonucleotide primers based on any coding sequence region 
5 intheCACNAlG sequence are useful for amplifying DNA by PCR. CACNA1G is 
described here for exemplary purposes. The other genes described herein as being 

For purposes of the invention, an antibody or nucleic acid probe specific for 
CACNA1G may be used to detect the presence of CACNA1G polypeptide (using 

10 antibody) or polynucleotide (using nucleic acid probe) in biological fluids or tissues. 
Oligonucleotide primers based on any coding sequence region in the CACNA1G 
sequence are useful for amplifying DNA, for example by PCR. Any specimen containing 
a detectable amount of CACNA1G polynucleotide or CACNA1G polypeptide antigen 
can be used. Nucleic acid can also be analyzed by RNA in situ methods which are known 

15 to those of skill in the art. A preferred sample of this invention is tissue of heart, renal, 
brain, colon, breast, urogenital, uterine, hematopoietic, prostate, thymus, lung, testis, and 
ovarian. Preferably the subject is human. 

Various disorders which are detectable by the method of the invention include 
20 astrocytoma, anaplastic astrocytoma, glioblastoma, medulloblastoma, colon cancer, lung 
cancer, renal cancer, leukemia, breast cancer, prostate cancer, endometrial cancer and 
neuroblastoma. 

Monoclonal antibodies used in the method of the invention are suited for use, for 
25 example, in immunoassays in which they can be utilized in liquid phase or bound to a 
solid phase carrier. In addition, the monoclonal antibodies in these immunoassays can be 
detectably labeled in various ways. Examples of types of immunoassays which can utilize 
monoclonal antibodies of the invention are competitive and non-competitive 
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immunoassays in either a direct or indirect format. Examples of such immunoassays are 
the radioimmunoassay (RIA) and the sandwich (immunometric) assay. Detection of the 
antigens using the monoclonal antibodies of the invention can be done utilizing 
immunoassays which are run in either the forward, reverse, or simultaneous modes, 
5 including immunohistochemical assays on physiological samples. Those of skill in the art 
will know, or can readily discern, other immunoassay formats without undue 
experimentation. 

The term "immunometric assay" or "sandwich immunoassay", includes 
10 simultaneous sandwich, forward sandwich and reverse sandwich immunoassays. These 
terms are well understood by those skilled in the art. Those of skill will also appreciate 
that antibodies according to the present invention will be useful in other variations and 
forms of assays which are presently known or which may be developed in the future. 
These are intended to be included within the scope of the present invention. 

15 

Monoclonal antibodies can be bound to many different carriers and used to detect 
the presence of CACNA1G . Examples of well-known carriers include glass, polystyrene, 
polypropylene, polyethylene, dextran, nylon, amylases, natural and modified celluloses, 
polyacrylamides, agaroses and magnetite. The nature of the carrier can be either soluble 
20 or insoluble for purposes of the invention. Those skilled in the art will know of other 

suitable carriers for binding monoclonal antibodies, or will be able to ascertain such using 
routine experimentation. 

In performing the assays it may be desirable to include certain "blockers" in the 
25 incubation medium (usually added with the labeled soluble antibody). The "blockers" are 
added to assure that non-specific proteins, proteases, or anti-heterophilic 
immunoglobulins to anti-CACNAlG immunoglobulins present in the experimental 
sample do not cross-link or destroy the antibodies on the solid phase support, or the 
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radiolabeled indicator antibody, to yield false positive or false negative results. The 
selection of "blockers" therefore may add substantially to the specificity of the assays 
described in the present invention. 

5 It has been found that a number of nonrelevant (i.e., nonspecific) antibodies of the 

same class or subclass (isotype) as those used in the assays (e.g., IgGl, IgG2a, IgM, etc.) 
can be used as "blockers". The concentration of the "blockers" (normally 1-100 
.mu.g/.mu.l) may be important, in order to maintain the proper sensitivity yet inhibit any 
unwanted interference by mutually occurring cross reactive proteins in the specimen. 

10 

In using a monoclonal antibody for the in vivo detection of antigen, the detectably 
labeled monoclonal antibody is given in a dose which is diagnostically effective. The 
term "diagnostically effective" means that the amount of detectably labeled monoclonal 
antibody is administered in sufficient quantity to enable detection of the site having the 

15 CACNA1G antigen for which the monoclonal antibodies are specific. The concentration 
of detectably labeled monoclonal antibody which is administered should be sufficient 
such that the binding to those cells having CACNA1G is detectable compared to the 
background. Further, it is desirable that the detectably labeled monoclonal antibody be 
rapidly cleared from the circulatory system in order to give the best target-to-background 

20 signal ratio. 

As a rule, the dosage of detectably labeled monoclonal antibody for in vivo 
diagnosis will vary depending on such factors as age, sex, and extent of disease of the 
individual. The dosage of monoclonal antibody can vary from about 0.001 mg/m.sup.2 to 
25 about 500 mg/m.sup.2, preferably 0.1 mg/m.sup.2 to about 200 mg/m.sup.2, most 

preferably about 0.1 mg/m.sup.2 to about 10 mg/m.sup.2. Such dosages may vary, for 
example, depending on whether multiple injections are given, tumor burden, and other 
factors known to those of skill in the art. 
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For in vivo diagnostic imaging, the type of detection instrument available is a 
major factor in selecting a given radioisotope. The radioisotope chosen must have al type 
of decay which is detectable for a given type of instrument. Still another important factor 

5 in selecting a radioisotope for in vivo diagnosis is that the half-life of the radioisotope be 
long enough so that it is still detectable at the time of maximum uptake by the target, but 
short enough so that deleterious radiation with respect to the host is minimized. Ideally, a 
radioisotope used for in vivo imaging will lack a particle emission, but produce a large 
number of photons in the 140-250 keV range, which may be readily detected by 

10 conventional gamma cameras. 



For in vivo diagnosis, radioisotopes may be bound to immunoglobulin either 
directly or indirectly by using an intermediate functional group. Intermediate functional 
groups which often are used to bind radioisotopes which exist as metallic ions to 
15 immunoglobulins are the bifunctional chelating agents such as 

diethylenetriaminepentactic acid (DTP A) and ethylenediaminetetraacetic acid (EDTA) 
and similar molecules. Typical examples of metallic ions which can be bound to the 
monoclonal antibodies of the invention are .sup. 11 1 In, .sup.97 Ru, .sup.67 Ga, .sup.68 
Ga, .sup.72 As, .sup.89 Zr, and .sup.201 Tl. 

20 

A monoclonal antibody useful in the method of the invention can also be labeled 
with a paramagnetic isotope for purposes of in vivo diagnosis, as in magnetic resonance 
imaging (MRI) or electron spin resonance (ESR). In general, any conventional method 
for visualizing diagnostic imaging can be utilized. Usually gamma and positron emitting 
25 radioisotopes are used for camera imaging and paramagnetic isotopes for MRI. Elements 
which are particularly useful in such techniques include .sup. 157 Gd, .sup.55 Mn, 
.sup. 162 Dy, .sup.52 Cr, and .sup.56 Fe. 



60 



The present invention also provides a method for treating a subject with a cell 
proliferative disorder associated with of CACNA1G comprising administering to a 
subject with the disorder a therapeutically effective amount of reagent which modulates 
CACNA1G expression. In brain, breast and renal cancer cells, for example, the 
5 CACNA1 G nucleotide sequence is under-expressed as compared to expression in a 
normal cell, therefore, it is possible to design appropriate therapeutic or diagnostic 
techniques directed to this sequence. Thus, where a cell-proliferative disorder is 
associated with the expression of CACNA1G associated with malignancy, nucleic acid 
sequences that modulate CACNA1G expression at the transcriptional or translational 
10 level can be used. In cases when a cell proliferative disorder or abnormal cell phenotype 
is associated with the under expression of CACNA1G , for example, nucleic acid 
sequences encoding CACNA1G (sense) could be administered to the subject with the 
disorder. 

15 The term "cell-proliferative disorder" denotes malignant as well as non-malignant 

cell populations which often appear to differ from the surrounding tissue both 
morphologically and genotypically. Such disorders may be associated, for example, with 
absence of expression of CACNA1G . Essentially, any disorder which is etiologically 
linked to expression of CACNA1G could be considered susceptible to treatment with a 

20 reagent of the invention which modulates CACNA1 G expression. 

The term "modulate" envisions the suppression of mefhylation of CACNA1G 
polynucleotide when CACNA1G is under-expressed. When a cell proliferative disorder 
is associated with CACNA1G expression, such methylation suppressive reagents as 5- 
25 azacytadine can be introduced to a cell. Alternatively, when a cell proliferative disorder is 
associated with under-expression of CACNA1G polypeptide, a sense polynucleotide 
sequence (the DNA coding strand) encoding CACNA1G polypeptide, or 5' regulatory 
nucleotide sequences (i.e., promoter) of CACNA1G in operable linkage with CACNA1G 
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polynucleotide can be introduced into the cell. Demethylases known in the art could also 
be used to remove methylation. 

The present invention also provides gene therapy for the treatment of cell 
5 proliferative disorders which are mediated by CACNA1G . Such therapy would achieve 
its therapeutic effect by introduction of the appropriate CACNA1G polynucleotide which 
contains a CACNA1G structural gene (sense), into cells of subjects having the 
proliferative disorder. Delivery of sense CACNA1G polynucleotide constructs can be 
achieved using a recombinant expression vector such as a chimeric virus or a colloidal 
10 dispersion system. 

The polynucleotide sequences used in the method of the invention may be the 
native, unmethylated sequence or, alternatively, may be a sequence in which a 
nonmethylatable analog is substituted within the sequence. Preferably, the analog is a 
15 nonmethylatable analog of cytidine, such as 5-azacytadine. Other analogs will be known 
to those of skill in the art. Alternatively, such nonmethylatable analogs could be 
administered to a subject as drug therapy, alone or simultaneously with a sense structural 
gene for CACNA1G or sense promoter for CACNA1G operably linked to CACNA1G 
structural gene. 

20 

In another embodiment, a CACNA1G structural gene is operably linked to a 
tissue specific heterologous promoter and used for gene therapy. For example, a 
CACNA1G gene can be ligated to prostate specific antigen (PSA)-prostate specific 
promoter for expression of CACNA1G in prostate tissue. Other tissue specific promoters 
25 will be known to those of skill in the art. Alternatively, the promoter for a tumor 
suppressor gene can be linked to the CACNA1G structural gene and used for gene 
therapy. 
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Various viral vectors which can be utilized for gene therapy as taught herein 
include adenovirus, herpes virus, vaccinia, or, preferably, an RNA virus such as a 
retrovirus. Preferably, the retroviral vector is a derivative of a murine or avian retrovirus. 
Examples of retroviral vectors in which a single foreign gene can be inserted include, but 
5 are not limited to: Moloney murine leukemia virus (MoMuLV), Harvey murine sarcoma 
virus (HaMuSV), murine mammary tumor virus (MuMTV), and Rous Sarcoma Virus 
(RSV). Most preferably, a non-human primate retroviral vector is employed, such as the 
gibbon ape leukemia virus (GaLV), thereby providing a broader host range than murine 
vectors, for example. 

10 

A number of additional retroviral vectors can incorporate multiple genes. All of 
these vectors can transfer or incorporate a gene for a selectable marker so that transduced 
cells can be identified and generated. Retroviral vectors can be made target specific by 
inserting, for example, a polynucleotide encoding a sugar, a glycolipid, or a protein. 
15 Preferred targeting is accomplished by using an antibody to target the retroviral vector. 
Those of skill in the art will know of, or can readily ascertain without undue 
experimentation, specific polynucleotide sequences which can be inserted into the 
retroviral genome to allow target specific delivery of the retroviral vector containing the 
CACNA1G sense or antisense polynucleotide. 

20 

Since recombinant retroviruses are defective, they require assistance in order to 
produce infectious vector particles. This assistance can be provided, for example, by 
using helper cell lines that contain plasmids encoding all of the structural genes of the 
retrovirus under the control of regulatory sequences within the LTR. These plasmids are 
25 missing a nucleotide sequence which enables the packaging mechanism to recognize an 
RNA transcript for encapsidation. Helper cell lines which have deletions of the packaging 
signal include but are not limited to .PSI.2, PA317 and PA 12, for example. These cell 
lines produce empty virions, since no genome is packaged. If a retroviral vector is 
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introduced into such cells in which the packaging signal is intact, but the structural genes 
are replaced by other genes of interest, the vector can be packaged and vector virion 
produced. 

5 Another targeted delivery system for CACNA1G polynucleotide is a colloidal 

dispersion system. Colloidal dispersion systems include macromolecule complexes, 
nanocapsules, microspheres, beads, and lipid-based systems including oil-in-water 
emulsions, micelles, mixed micelles, and liposomes. The preferred colloidal system of 
this invention is a liposome. Liposomes are artificial membrane vesicles which are useful 

10 as delivery vehicles in vitro and in vivo. It has been shown that large unilamellar vesicles 
(LUV), which range in size from 0.2-4.0 urn can encapsulate a substantial percentage of 
an aqueous buffer containing large macromolecules. RNA, DNA and intact virions can be 
encapsulated within the aqueous interior and be delivered to cells in a biologically active 
form (Fraley, et al., Trends Biochem. Sci., 6:77, 1981). In addition to mammalian cells, 

15 liposomes have been used for delivery of polynucleotides in plant, yeast and bacterial 
cells. In order for a liposome to be an efficient gene transfer vehicle, the following 
characteristics should be present: (1) encapsulation of the genes of interest at high 
efficiency while not compromising their biological activity; (2) preferential and 
substantial binding to a target cell in comparison to non-target cells; (3) delivery of the 

20 aqueous contents of the vesicle to the target cell cytoplasm at high efficiency; and (4) 
accurate and effective expression of genetic information (Mannino, et al., Biotechniques, 
6:682, 1988). 

The composition of the liposome is usually a combination of phospholipids, 
25 particularly high-phase-transition-temperature phospholipids, usually in combination with 
steroids, especially cholesterol. Other phospholipids or other lipids may also be used. The 
physical characteristics of liposomes depend on pH, ionic strength, and the presence of 
divalent cations. 
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Examples of lipids useful in liposome production include phosphatidyl 
compounds, such as phosphatidylglycerol, phosphatidylcholine, phosphatidylserine, 
phosphatidylethanolamine, sphingolipids, cerebrosides, and gangliosides. Particularly 
5 useful are diacylphosphatidylglycerols, where the lipid moiety contains from 14-18 
carbon atoms, particularly from 16-18 carbon atoms, and is saturated. Illustrative 
phospholipids include egg phosphatidylcholine, dipalmitoylphosphatidylcholine and 
distearoylphosphatidylcholine. 

1 0 The targeting of liposomes has been classified based on anatomical and 

mechanistic factors. Anatomical classification is based on the level of selectivity, for 
example, organ-specific, cell-specific, and organelle-specific. Mechanistic targeting can 
be distinguished based upon whether it is passive or active. Passive targeting utilizes the 
natural tendency of liposomes to distribute to cells of the reticulo-endothelial system 

15 (RES) in organs which contain sinusoidal capillaries. Active targeting, on the other hand, 
involves alteration of the liposome by coupling the liposome to a specific ligand such as a 
monoclonal antibody, sugar, glycolipid, or protein, or by changing the composition or 
size of the liposome in order to achieve targeting to organs and cell types other than the 
naturally occurring sites of localization. 

20 

The surface of the targeted delivery system may be modified in a variety of ways. 
In the case of a liposomal targeted delivery system, lipid groups can be incorporated into 
the lipid bilayer of the liposome in order to maintain the targeting ligand in stable 
association with the liposomal bilayer. Various linking groups can be used for joining the 
25 lipid chains to the targeting ligand. 

In general, the compounds bound to the surface of the targeted delivery system 
will be ligands and receptors which will allow the targeted delivery system to find and 
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"home in" on the desired cells. A ligand may be any compound of interest which will 
bind to another compound, such as a receptor. 

In general, surface membrane proteins which bind to specific effector molecules are 
5 referred to as receptors. In the present invention, antibodies are preferred receptors. 

Antibodies can be used to target liposomes to specific cell-surface ligands. For example, 
certain antigens expressed specifically on tumor cells, referred to as tumor-associated 
antigens (TAAs), may be exploited for the purpose of targeting CACNA1G antibody- 
containing liposomes directly to the malignant tumor. Since the CACNA1G gene product 

10 may be indiscriminate with respect to cell type in its action, a targeted delivery system 
offers a significant improvement over randomly injecting non-specific liposomes. 
Preferably, the target tissue is human brain, colon, breast, lung, and renal origin. A 
number of procedures can be used to covalently attach either polyclonal or monoclonal 
antibodies to a liposome bilayer. Antibody-targeted liposomes can include monoclonal or 

15 polyclonal antibodies or fragments thereof such as Fab, or F(ab').sub.2, as long as they 
bind efficiently to an antigenic epitope on the target cells. Liposomes may also be 
targeted to cells expressing receptors for hormones or other serum factors. 

It should be noted that as used herein and in the appended claims, the singular 
20 forms "a," "and," and "the" include plural referents unless the context clearly dictates 
otherwise. Thus, for example, reference to "a cell" includes a plurality of such cells and 
reference to "the restriction enzyme" includes reference to one or more restriction 
enzymes and equivalents thereof known to those skilled in the art, and so forth. 

25 Unless defined otherwise, all technical and scientific terms used herein have the 

same meaning as commonly understood to one of ordinary skill in the art to which this 
invention belongs. Although any methods, devices and materials similar or equivalent to 
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those described herein can be used in the practice or testing of the invention, the preferred 
methods, devices and materials are now described. 

All publications mentioned herein are incorporated herein by reference in full for 
5 the purpose of describing and disclosing the methodologies which are described in the 
publications which might be used in connection with the presently described invention. 
The publications discussed above and throughout the text are provided solely for their 
disclosure prior to the filing date of the present application. Nothing herein is to be 
construed as an admission that the inventors are not entitled to antedate such disclosure 
10 by virtue of prior invention. 

The following example is intended to illustrate but not to limit the invention in 
any manner, shape, or form, either explicitly or implicitly. While they are typical of 
those that might be used, other procedures, methodologies, or techniques known to those 
15 skilled in the art may alternatively be used. 

EXAMPLE 1 

IDENTIFICATION OF CACNA1G AS A TARGET FOR 
HYPERMETHYLATION O N HITMAN CHROMOSOME 17a21 

20 

In order to isolate genes differentially methylated in cancer cells as opposed to 
non-cancerous cells the following experimental protocols were used. An example of the 
results obtained is provided hereinabove in the description of the isolation and 
characterization of human CANNA1G. 



Tissue Samples and Cell Lines 

Forty-nine primary colorectal cancers, 28 colorectal adenomas, 16 primary gastric 
cancers and 17 acute myelogenous leukemia samples were used for methylation analyses. 
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DNA from eight colon cancer cell lines (Caco2, RKO, SW48, HCT116, DLD1, Lovo, 
SW837, HT29), 4 lung cancer cell lines (OH3, H249, HI 57, H209), 4 glioblastoma cell 
fines (Dauy, D283, U87, U373), 8 breast cancer cell lines (MB-468, MCF7, MB-231, 
MB-474, MB-435, MB-453, BT20, CAMA1, SKBR3), 7 hematopoietic tumor cell lines 

5 (CEM, Raji, KG1A, HL60, ML-l,MoIt3, K562), and 4 prostate cancer cell lines (DU145, 
DUPRO, LNCAP, TSUPRL) were also investigated. DNA was extracted by standard 
procedures. RNA was isolated from cell lines and adenomas using TRIZOL (GIBCO- 
BRL). For re-expression analysis, cell lines were treated with 5-Aza-deoxycytidine 
(SIGMA) at a final concentration of 1 M for 6 days. All tissue samples were obtained 

10 from patients who gave informed consent according to institutional guidelines. 

RT-PCR. 

Six g of total RNA, was reverse transcribed using the SUPERSCRIPT kit 
(GIBCO-BRL) for first strand cDNA synthesis. One hundred ng of cDNA was used as 

1 5 template for RT-PCR reactions. To design the RT-PCR primers, Blast search was 
performed using the rat CacnalG cDNA sequence (Genbank AF027984) reported 
previously (25) and exon-intron boundaries of the human CACNA1G were predicted by 
this analysis. Each primer set was designed to amplify the cDNA across several introns. 
Primer sequences and PCR conditions are available, at 

20 http://www.med.jhu.edu/methylation/primers. GAPDH was also amplified as a control 
using primers GAPDHF: 5 ' -CGGAGTC AACGGATTGGTCGTAT-3 ' (SEQ ID NO:55) 
and GAPDHR: 5 ' - AGCCTTCTCC ATGGTGGTGAAGAC-3 ' (SEQ ID NO:56). All 
reactions were performed with RT(-) controls. PCR amplification was performed for 35 
cycles of 95° C 30 sec, 60-65°C for 30 sec, 72°C for 30 sec, and the products were 

25 analyzed by agarose gel electrophoresis. 



DNA Sequencing and Data Analysis. 
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PCR reaction products were precipitated with ethanol, resuspended in diluted 
water and cloned into the pCR2.1 vector using the TA cloning kit (Invitrogen) according 
to the manufacturer's instruction. After transformation, plasmid DNA was purified using 
the Wizard Miniprep Kit (Promega). DNA sequence analysis was carried out at the Johns 
5 Hopkins University Sequence Facility using automated DNA sequencers (Applied 

Biosy stems). Sequence homology was identified by the BLAST program of the National 
Center for Biological Information (NCBI) available at http://www.ncbi.nlm.nih.gov/ 
BLAST/. An IMAGE cDNA clone (Genbank: HI 33 33) was identified by BLAST 
analysis using the sequence of BAC AC004590 (Genbank) which includes MINT31. 

10 Putative genes (Gl and G2) were identified by GENSCAN (available at http://ccr- 

081.mit.edu/GENSCANMIT.html) using the BAC sequence data. IMAGE cDNA clone 
HI 333) was then obtained from the American Type Culture Collection and completely 
sequenced. Potential transcription factor binding sites and promoter prediction were 
examined using the TESS and TSSG programs respectively, available at the Baylor 

15 College of Medicine BCM Launcher (http://kiwi.imgen.bcm.tmc.edu:8088/search 

launcher/launcher.html/). The nucleotide sequence of part of the 5' end of the cDNA of 
CACNA1G has been submitted to Genbank. 



Bisulfite-PCR Methylation Analysis. 

20 Bisulfite treatment was performed as reports previously. (Herman, J. G., Graff, J. 

R., Myohanen, S., Nelkin. B. D. and Baylin, S.B. Proc. Natl. Acad. Sci. USA. 93:9821- 
9826, 1996). Briefly, 2 g of genomic DNA was denatured with 2 M NaOH for 10 min., 
followed by incubation with 3 M Na-bisulfite, pH 5.0 for 16 hours at 50°C. After 
treatment DNA was purified using a Wizard Miniprep Column (Promega), precipitated 

25 with ethanol and resuspended in 20 1 of diluted water. Two 1 of the aliquot was used as 
template for each PCR reaction. Semi-quantitative bisulfite-PCR was performed 
essentially as described. Xiong, Z. and Laird, P. W. COBRA: a sensitive and quantitative 
DNA methylation assay. Nucleic Acids Res. 25:2532-2534, 1997. To avoid 
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overestimation of the methylated alleles, the following points were considered. First, 
primers were designed to contain a minimum number of CpG dinucleotides in the 
sequence to avoid the biased amplification of methylated alleles. If primers do contain 
CpG sites, they were designed to amplify methylated and unmethylated alleles equally 
5 (using a mixture of C or T for sense and a mixture of G or A for antisense primers). 
Second, the primers were designed to contain a maximum number of thymidines 
converted from cytosines to avoid amplifying the non-converted genomic sequence. 
Third, restriction sites which only appear after bisulfite conversion (e.g. ACGC to 
ACGT) were used (regions 1-8). PCR was performed as described previously (Herman, 

10 supra). Primer sequences, annealing temperature and PCR cycles are available at 

http://www.jhu.edu/methylation/primers. Twenty % of the PCR products were digested 
with the appropriate restriction enzymes, precipitated with ethanol and separated by 5% 
polysacrylamide gel electrophoresis. Gels were stained with ethidium bromide, and the 
intensity of each allele was calculated by densitometry, using the Image Quant software 

15 (Molecular Dynamics). 



Although the invention has been described with reference to the presently 
preferred embodiment, it should be understood that various modifications can be made 
without departing from the spirit of the invention. Accordingly, the invention is limited 
20 only by the following claims. 
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What is claimed is: 

1 . An isolated nucleic acid molecule comprising the coding region for a T-type calcium 
channel and regulatory sequences associated therewith. 

2. The nucleic acid molecule of claim 1, wherein said associated regulatory sequences 
contain CpG-rich regions. 

3 . The nucleic acid molecule of claim 2, wherein the state of methylation of the CpG- 
rich regions is determinative of the presence of a cellular proliferative disorder in a 
subject from which the nucleic acid molecule is isolated. 

4. The nucleic acid molecule of claim 2, wherein hypermethylation of said CpG islands 
is indicative of the presence of a cellular proliferative disorder in a subject from 
which said nucleic acid is isolated. 

5. The nucleic acid molecule of claim 1 , wherein said T-type calcium channel is 
CACNA1G. 

6. The nucleic acid molecule of claim 5, wherein said nucleic acid comprises the nucleic 
acid sequence of SEQ ID NO: 51 . 

7. The nucleic acid molecule of claim 6, wherein one or more of regions 1-8 comprises 
methylated bases. 



8. A substantially purified polypeptide encoded by the polynucleotide of SEQ ID 
NO.51. 
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9. The polypeptide of claim 8, wherein the polypeptide has an amino acid sequence as 
set forth in SEQ ID NO:52. 

10. A method for detecting a cellular proliferative disorder in a subject comprising: 

a) contacting a nucleic acid-containing specimen from the subject with an agent that 
provides a determination of the methylation state of at least one gene or associated 
regulatory region of the gene; and 

b) identifying aberrant methylation of regions of the gene or regulatory region, 
wherein aberrant methylation is identified as being different when compared to 
the same regions of the gene or associated regulatory region in a subject not 
having said cellular proliferative, thereby detecting a cellular proliferative 
disorder in the subject. 

1 1 . The method of claim 10, wherein the regions of said gene are contained within CpG 
rich regions. 

12. The method of claim 10, wherein the gene is selected from the group consisting of 
APOB, CACNA1G, CDX2, EGFR, FBN1, GPR37, HSPA6, IQGAP2, KL, PAR2, 
PITX2, PTCH, SDC1, SDC4 and combinations thereof. 

13. The method of claim 10, wherein aberrant methylation comprises hypermethylation 
when compared to the same regions of the gene or associated regulatory regions in a 
subject not having the cellular proliferative disorder. 

14. The method of claim 13, wherein the regions comprise regulatory regions of 
CACNA1G. 

15. The method of claim 14, wherein the regions comprise regions 1-8 of CACNA1G. 
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16. The method of claim 15, wherein the regions comprise regions 1-2 of CACNA1G. 

17. The method of claim 15, wherein the regions comprise regions 5-7 of CACNA1G. 

18. The method of claim 15, wherein the regions comprise regions 4 and 8 of 
CACNA1G. 

19. The method of claim 10, wherein the agent is a pair of primers that hybridize with a 
target sequence in the gene or associated regulatory region of the gene. 

20. The method of claim 19, wherein the primers hybridize with a target polynucleotide 
sequence having the sequence selected from the group consisting of SEQ ID NO: 55- 
103 and SEQ ID NO: 104. 

21. The method of claim 20, wherein the primers are in consecutive pairs selected from 
the group consisting of SEQ ID NO:l-49 and SEQ ID NO:50. 

22. The method of claim 10, wherein the nucleic acid-containing specimen comprises a 
tissue selected from the group consisting of brain, colon, urogenital, lung, renal, 
prostate, pancreas, liver, esophagus, stomach, hematopoietic, breast, thymus, testis, 
ovarian, and uterine. 

23. The method of claim 10, wherein the nucleic acid-containing specimen is selected 
from the group consisting of serum, urine, saliva, blood, cerebrospinal fluid, pleural 
fluid, ascites fluid, sputum, stool, and biopsy sample. 
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24. The method of claim 10, wherein said cellular proliferative disorder is selected from 
the group consisting of low grade astrocytoma, anaplastic astrocytoma, glioblastoma, 
medulloblastoma, gastric cancer, colorectal cancer, colorectal adenoma, acute 
myelogenous leukemia, lung cancer, renal cancer, leukemia, breast cancer, prostate 
cancer, endometrial cancer and neuroblastoma. 

25. A kit useful for the detection of a cellular proliferative disorder in a subject 
comprising: 

a) carrier means compartmentalized to receive a sample therein; 

b) one or more containers comprising a first container containing a reagent which 
modifies unmethylated cytosine and a second container containing primers for 
amplification of a CpG-containing nucleic acid, wherein the primer hybridizes 
with a target polynucleotide sequence having the sequence selected from the 
group consisting of SEQ ID NO:55-103 and SEQ ID NO:104. 

26. The kit of claim 25, further comprising a third container containing a methylation 
sensitive restriction endonuclease. 

27. The kit of claim 25, wherein said modifying reagent is bisulfite. 

28. The kit of claim 25, wherein the primers are selected from the group consisting of 
SEQ ID NO: 1-49 and SEQ ID NO: 50. 

29. Isolated oligonucleotide primer(s) for detection of a methylated CpG-containing 
nucleic acid wherein the primer hybridizes with a target polynucleotide sequence 
having the sequence selected from the group consisting of SEQ ID NO:55-103 and 
SEQ ID NO: 104. 
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30. The primers of claim 29, wherein said primers are selected from the group consisting 
of SEQ ID NO: 1-49 and SEQ ID NO: 50. 

31. An isolated nucleic acid molecule having at least one methylated Cytosine of a CpG 
dinucleotide in a CpG-rich region and encoding a gene selected from the group consisting 
of APOB, CACNA1G, CDX2, EGFR, FBN1, GPR37, HSPA6, IQGAP2, KL, PAR2, 
PITX2, PTCH, SDC1, and SDC4. 

32. The nucleic acid molecule of claim 31, wherein the methylated C residue of a CpG 
dinucleotide is located within a CpG-rich region selected from the group consisting of 
SEQ ID NO: 105-1 18 and SEQ ID NO: 119. 
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ABSTRACT OF THE INVENTION 



A novel T-type calcium channel (CACNA1G) is provided, as are polynucleotides 
encoding the same. CACNA1G has been implicated in cellular proliferative disorders. 
More specifically, it has been observed that the methylation state of specific regions 
within CpG islands associated with the CACNA1G gene correlates with a number of 
cancerous phenotypes involving a variety of tissue and cell types. Also provided are 
methods for detecting cellular proliferative disorders by deteimining the methylation state 
of genes or regulatory regions associated therewith, including CACNA1G, as well as kits 
containing reagents for performing invention methods. 



Figure 1A 
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FIGURE 3A (SEQ ID NO: 51) CACNA1G nucleotide sequence 



CCTTTTCGTTCGCCCTCTCGGGGCGGCTTCGCCGAAGGTAGCGCCGAATCCGGCAACCGGAGCCTGGGCGCGAAGCGAAG 
AAGCCGGAACAAAGTGAGGGGGAGCCGGCCGGCTGGCCCGGGAAGCCCCAGGGGCGCAGGGGAAGCGGGACTCGCGCCGG 
GCGGGGTTTCCCTGCGCCCCGGCGCCCCGCGGGCAGCATGCCCCTGCGGGCAGGGGGAGCTGGGCTGAACTGGCCCTCCC 
GGGGGCTCAGCTTGCGCCCTAGAGCCCACCAGATGTGCCCCCGCCGGGGCCCCCGGGTTGCGTGAGGACACCTCCTCTGA 
GGGGCGCCGCTTGCCCCTCTCCGGATCGCCCGGGGCCCCGGCTGGCCAGAGGATGGACGAGGAGGAGGATGGAGCGGGCG 
CCGAGGAGTCGGGACAGCCCCGGAGCTTCATGCGGCTCAACGACCTGTCGGGGGCCGGGGGCCGGCCGGGGCCGGGGTCA 

CTTGAGCCAGGACAGCCGCCCGCGGAGCTGGTGTCTCCGCACGGTCTGTAACCCCTGGTTTGAGCGCATCAGCATGTTGG 
TCATCCTTCTCAACTGCGTGACCCTGGGCATGTTCCGGCCATGCGAGGACATCGCCTGTGACTCCCAGCGCTGCCGGATC 
CTGCAGGCCTTTGATGACTTCATCTTTGCCTTCTTTGCCGTGGAGATGGTGGTGAAGATGGTGGCCTTGGGCATCTTTGG 
GAAAAAGTGTTACCTGGGAGACACTTGGAACCGGCTTGACTTTTTCATCGTCATCGCAGGGATGCTGGAGTACTCGCTGG 
ACCTGCAGAACGTCAGCTTCTCAGCTGTCAGGACAGTCCGTGTGCTGCGACCGCTCAGGGCCATTAACCGGGTGCCCAGC 
ATGCGCATCCTTGTCACGTTGCTGCTGGATACGCTGCCCATGCTGGGCAACGTCCTGCTGCTCTGCTTCTTCGTCTTCTT 
CATCTTCGGCAT CGTCGGC6TCCAGC TGTGGGCAGGGCTGCTTCGGAACCGATGC TTC CTACCTGAGAATTTCAGC CTC C 
CCCTGAGCGTGGACCTGGAGCGCTATTACCAGACAGAGAACGAGGATGAGAGCCCCTTCATCTGCTCCCAGCCACGCGAG 
AACGGCATGCGGTCCTGCAGAAGCGTGCCCACGCTGCGCGGGGACGGGGGCGGTGGCCCACCTTGCGGTCTGGACTATGA 
GGCCTACAACAGCTCCAGCAAGACCACCTGTGTCAACTGGAACCAGTACTACACGAACTGCTC^GCGGGGGAGCACAACC 
CCTTCAAGGGCGCCATCAACTTTGACAACATTGGCTATGCCTGGATCGCCATCTTCCAGGTCATCACGCTGGAGGGCTGG 
GTCGACATCATGTACTTTGTGATGGATGCTCATTCCTTCTACAATTTCATCTACTTCATCCTCCTCATCATCGTGGGCTC 
CTTCTTCATGATCAACCTGTGCCTGGTGGTGATTGCCACGCAGTTCTCAGAGACCAAGCAGCGGGAAAGCCAGCTGATGC 
GGGAGCAGCGTGTGCGGTTCCTGTCCAACGCCAGCACCCTGGCTAGCTTCTCTGAGCCCGGCAGCTGCTATGAGGAGCTG 
CTCAAGTACCTGGTGTACATCCTTCGTAAGGCAGCCCGCAGGCTGGCTCAGGTCTCTCGGGCAGCAGGTGTGCGGGTTGG 
GCTGCTCAGCAGCCCAGCACCCCTCGGGGGCCAGGAGACCCAGCCCAGCAGCAGCTGCTCTCGCTCCCACCGCCGCCTAT 
CCGTCCACCACCTGGTGCACCACCACCACCACCATCACCACCACTACCACCTGGGCAATGGGACGCTCAGGGCCCCCCGG 
GCCAGCCCGGAGATCCAGGACAGGGATGCCAATGGGTCCCGCCGGCTCATGCTGCCACCACCCTCGACGCCTGCCCTCTC 
CGGGGCCCCCCCTGGTGGCGCAGAGTCTGTGCACAGCTTCTACCATGCCGACTGCCACTTAGAGCCAGTCCGCTGCCAGG 
CGCCCCCTCCCAGGTCCCCATCTGAGGCATCCGGCAGGACTGTGGGCAGCGGGAAGGTGTATCCCACCGTGCACACCAGC 
CCTCCACCGGAGACGCTGAAGGAGAAGGCACTAGTAGAGGTGGCTGCCAGCTCTGGGCCCCCAACCCTCACCAGCCTCAA 
CATCCCACCCGGGCCCTACAGCTCCATGCACAAGCTGCTGGAGACACAGAGTACAGGTGCCTGCCAAAGCTCTTGCAAGA 
TCTCCAGCCCTTGCTTGAAAGCAGACAGTGGAGCCTGTGGTCCAGACAGCTGCCCCTACTGTGCCCGGGCCGGGGCAGGG 
GAGGTGGAGCTCGCCGACCGTGAAATGCCTGACTCAGACAGCGAGGCAGTTTATGAGTTCACACAGGATGCCCAGCACAG 
CGACCTCCGGGACCCCCACAGCCGGCGGCAACGGAGCCTGGGCCCAGATGCAGAGCCCAGCTCTGTGCTGGCCTTCTGGA 
GGCTAATCTGTGACACCTTCCGAAAGATTGTGGACAGCAAGTACTTTGGCCGGGGAATCATGATCGCCATCCTGGTCAAC 
ACACTCAGCATGGGCATCGAATACCACGAGCAGCCCGAGGAGCTTACCAACGCCCTAGAAATCAGCAACATCGTCTTCAC 
CAGCCTCTTTGCCCTGGAGATGCTGCTGAAGCTGCTTGTGTATGGTCCCTTTGGCTACATCAAGAATCCCTACAACATCT 
TCGATGGTGTCATTGTGGTCATCAGCGTGTGGGAGATCGTGGGCCAGCAGGGGGGCGGCCTGTCGGTGCTGCGGACCTTC 
CGCCTGATGCGTGTGCTGAAGCTGGTGCGCTTCCTGCCGGCGCTGCAGCGGCAGCTGGTGGTGCTCATGAAGACCATGGA 
CAACGTGGCCACCTTCTGCATGCTGCTTATGCTCTTCATCTTCATCTTCAGCATCCTGGGCATGCATCTCTTCGGCTGCA 
AGTTTGCCTCTGAGCGGGATGGGGACACCCTGCCAGACCGGAAGAATTTTGACTCCTTGCTCTGGGCCATCGTCACTGTC 
TTTCAGATCCTGACCCAGGAGGACTGGAACAAAGTCCTCTACAATGGTATGGCCTCCACGTCGTCCTGGGCGGCCCTTTA 
TTTCATTGCCCTCATGACCTTCGGCAACTACGTGCTCTTCAATTTGCTGGTCGCCATTCTGGTGGAGGGCTTCCAGGCGG 
AGGGAGATGCCAACAAGTCCGAATCAGAGCCCGATTTCTTCTCACCCAGCCTGGATGGTGATGGGGACAGGAAGAAGTGC 
TTGGCCTTGGTGTCCCTGGGAGAGCACCCGGAGCTGCGGAAGAGCCTGCTGCCGCCTCTCATCATCCACACGGCCGCCAC 
ACCCATGTCGCTGCCCAAGAGCACCAGCACGGGCCTGGGCGAGGCGCTGGGCCCTGCGTCGCGCCGCACCAGCAGCAGCG 
GGTCGGCAGAGCCTGGGGCGGCCCACGAGATGAAGTCACCGCCCAGCGCCCGCAGCTCTCCGCACAGCCCCTGGAGCGCT 
GCAAGCAGCTGGACCAGCAGGCGCTCCAGCCGGAACAGCCTCGGCCGTGCACCCAGCCTGAAGCGGAGAAGCCCAAGTGG 
AGAGCGGCGGTCCCTGTTGTCGGGAGAAGGCCAGGAGAGCCAGGATGAAGAGGAGAGCTCAGAAGAGGAGCGGGCCAGCC 
CTGCGGGCAGTGACCATCGCCACAGGGGGTCCCTGGAGCGGGAGGCCAAGAGTTCCTTTGACCTGCCAGACACACTGCAG 
GTGCCAGGGCTGCATCGCACTGCCAGTGGCCGAGGGTCTGCTTCTGAGCACCAGGACTGCAATGGCAAGTCGGCTTCAGG 
GCGCCTGGCCCGGGCCCTGCGGCCTGATGACCCCCCACTGGATGGGGATGACGCCGATGACGAGGGCAACCTG 



Figure 3B (SEQ ID NO: 52) 



MDEEEDGAGAEESGQPRSFMKLNDLSGAGGRPGPGSAEKDPGSA 
DSEAEGLPYPALAPVVFFYIjSQDSRPRSWCLRTVCNPWFERISMLVILLNCVTLGMFR 
P CED I ACDS QRCRI LQAFDDF I FAFFAVEMWKMVALG I FGKKCYLGDTWNRLD FFI V 
IAGMLEYSLDLQNVSFSAVRTVRVLRPLRAINRVPSMRILVTLLLDTLPMLGNVLLLC 
FFVFFIFGIVGVOLWAGLLRNRCFLPENFSLPLSVDLERYYQTENEDESPFICSQPRE 
NGMRSC^SVPTLRGDGGGGPPCGLDYEAYNSSSNTTCYNWNQYYTNCSAGEHNPFKGA 
INFDNIGYAWIAI FQVITLEGWVDIMYFVMDAHSFYNFI YFILLI IVGSFFMINLCLV 
VIATQFSETKQRESQLMREQRVRFLSNASTLASFSEPGSCYEELLKYLVYILRKAARR 
LAQVSRAAGTOVGLLSSPAPLGGQETQPSSSCSRSHRRLSVHHLVHHHHHHHHHYHLG 
KGTLRAPRASPEIQDRDANGSRRLMLPPPSTPALSGAPPGGAESVHSFYHADCHLEPV 
RCQAPPPRSPSEASGRWGSGKVYPTVHTSPPPETIJCEKALVEYAASSGPPTLTSLNI 
PPGPYSSMHKLLETQSTGACQSSCKISSPCLKADSGACGPDSCPYCARAGAGEVELAD 
REMPDSDSEAVYEFTQDAQHSDLRDPHSRRQRSLGPDAEPSSVLAFWRLICDTFRKIV 
DSKYFGRGIMIAILVNTLSMGIEYHEQPEELTNALEISNIVFTSLFALEMLLKLLVYG 
PFGYI KNPYNI FDGVIWI SVWE I VGQQGGGLSVLRTFRLMRVLKLVRFLPALQRQLV 
VLMKTMDJWATFCMLLKLFIFIFSiriGMHLFGCKFASERDGDTLPDRKNFDSLIjWAIV 
TVFQILTQEDWNKVLYNGMASTSSWAALYFIALMTFGNYVLFNLLVAILVEGFQAEGD 
ANKSESEPDFFSPSLDGDGDRKKCLALVSLGEHPELRKSLLPPL I IHTAATPMSLPKS 
TSTGLGEALGPASRRTSSSGSAEPGAAHEMKSPPSARSSPHSPWSAASSWTSRRSSRN 
SLGRAPSLKRRSPSGERRSLLSGEGQESQDEEESSEEERASPAGSDHRHRGSLEREAK 
SSFDLPDTLQVPGLHRTASGRGSASEHQDCNGKSASGRLARALRPDDPPLDGDDADDE 
GNL 



FIGURE 4A 



APOB CpG ISLAND (SEQ ID NO: 105) 

cccgggaggcgccctttggaccttttgcaatcctggcgctcttgcagcctgggcttcctataaatggggtgcgggcgccggccgcgcattc 
ccaccgggacctgcggggctgagtgcccttctcggttgctgccgctgaggagcccgcccagccagccagggccgcgaggccgaggcc 
aggccgcagcccaggagccgccccaccgcagctggcgatggacccgccgaggcccgcgctgctggcgctgcctgcgctgctgctgct 
gctgctggcgggcgccagggccggtgagtgcgcggccgctctgcgggcagcagagggagcgggagggagccggcggaccgaggtt 
ggccggggcagcctgggcctaggccagagggagggcagccacagggtccagggcgagtggggggattggaccagctggcggcccc 
tgcaggctcaggatggggggcgcgggatggaggggctgaggagggggtctccggagcctgcctccctcctgaaaggtgaaacctgtgc 
cggtggtccccctgtcgggcccctagcacccgctgggaagacgtgggaagctc 

CACNA1G CpG ISLAND (SEQ ID NO: 106) 

cctgcggccctacgccaggaccccgcgccgaatactctgattcttcgggctccctccaagggagtcccaaagaccccaatggccaatagg 

aaagtgggttcggtctgggcagcagtctgattggctccagccttcgggagcggacccaggggcaaggggaggggagaggggcggtcct 

gggttttggggtgggaatcggattccagctgtggttctctccctgcgctcccgcccgcactgccacggcggacggccaatgggcgcgcgg 

ctcggggccggcggcgtccggcgattggctgcggggctgtctgggggcggggccgaggcttgaagttgaagtgagggatccagctgtg 

gtgtgcgcggggctcctcgccgccgctttcgctcgctcgctccgcgtctcggccggaggaggaggctgtggcgccggcgacagctacg 

gcagcggcagccaccgcggcggctgcggcggcggcatctccgcctccactcccgcccgggactgccccccactgtctccccgcccctc 

ccggacagtgagcccgcggcggggcgggggaaggagccgcccccaccccctccaagcccacccctaaagagatccctcctcccctcc 

cccgccgcctggcgcggagccgggacgatgctgaccccttagatccggctccagctgcgccgcgggaagagggggcgcccctccccg 

gacccccgccctccgccgctgccccccttttcgttcgccctctcggggcggcttcgccgaaggtagcgccgaatccggcaaccggagcct 

gggcgcgaagcgaagaagccggaacaaagtgagggggagccggccggctggcccgggaagccccaggggcgcaggggaagcgg 

gactcgcgccgggcggggtttccctgcgccccggcgccccgcgggcagcatgcccctgcgggcagggggagctgggctgaactggc 

cctcccgggggctcagcttgcgccctagagcccaccagatgtgcccccgccggggcccccgggttgcgtgaggacacctcctctgagg 

Ggcgccgcttgcccctctccggatcgcccggggccccggctggccagaggatggacgaggaggaggatggagcgggcgccgagga 

gtcgggacagccccggagcttcatgcggctcaacgacctgtcgggggccgggggcggccggggccggggtcagcagaaaaggaccc 

gggcagcgcggactccgaggcggaggggctgccgtacccggcgctggccccggtggttttcttctacttgagccaggacagccgcccg 

cggagctggTgtctccgcacggtctgtaacccatatccttcggggcacgacggccaggcgcggggtcagaagggggacgggccgcac 

cgccgggggtcgggggggaagaagacccaccgccaggtgagtcgaagtgagcccggagggtaggcggatggggggggggctgcc 

agggaggggagggggcaccagagtgggagcggagacgcgagcaggtctcgtcggtaacccgggcttaccccacctgcgtacacaca 

cctcagtcttcctgggttgggggggtggggatccaggccaggagaagagagctgtgccccgctggctcgcagctggacgccctccagat 

gtggtcaggggagggtcgtcatcctccagatgtgggaagcttcgggagcctgggagctgtactctgcccgcgccggttagcgagctgggt 

ttggtttccgagtttggtggggggtgggtgggggcggtggggaggaagctgcggggacggaggaggggggaccgcaatctcctgggtt 

tccctccttcccccgccccaaagtttgcggcggattctagatgttggggggcggggaccaggtcctggcccacctcaccccccacctcgcg 

ggttggaggcacaacaaggagattccggcggcggctgatgtcaggggcgcagaatgagaacaagatgtggtggaggggagctgtctgc 

ccccggagctgggagtggagcccctttccgctagagcccagtgccgcgggtgcctcctacccgatctccattcgatgc 
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CDX2 CpG Island (SEQ ID NO: 107) 

ctcgttaatcacggaagccgccggcctggggctccgcacgccagcctgtgcgggtcttccccgcctctgcagcctagtgggaaggaggt 
gggaggaaagaaggaagaaagggagggagggaggaggcaggccagagggagggaccgcctcggaggcagaagagccgcgagg 
agccagcggagcaccgcgggctggggcgcagccacccgccgctcctcgagtcccctcgcccctttcccttcgtgccccccggcagcctc 
cagcgtcggtccccaggcagcatggtgaggtctgctcccggtccctcgccaccatgtacgtgagcta 

EGFR CpG Island (SEQ ID NO: 108) 

gtccgcggggaccgggtccagaggggcagtgctgggaacgcccctctcggaaattaactcctcagggcaccgctcccctcccatgcgcc 

gccccactcccgccggagactaggtcccgcgggggccaccgtgtccaccgcctcgcggccgctggccttgggtccccgctgctggttct 

cctccctcctcctcgcattctcGtcctcctctgctcctcccgatccctcctccgccgcctggtccctcctcctcccgccctgcctcccgcgcctc 

ggcccgcgcgagctagacgtccgggcagcccccggcgcagcgcggccgcagcagcctcctccccccgcacggtgtgagcgcccgcc 

gcgccgaggcggccggagtcccgagctagccccgcggccgccgccgcccagaccggacgacaggccacctcgtcgcgtccgcccg 

agtccccgcctcgccgccaacgccacaaccaccgcgcacggccccctgactccgtccagtattgatcgggagagccggagcgagctctt 

cggggagcagcgatgcgaccctccgggacggccggggcagcgctcctggcgctgctggctgcgctctgccCggcgagtcgggctctg 

Gaggaaaagaaaggtaagggcgtgtctcgcggctccccgccgcccccggatcgcgccccggaccccgcagcccgcccaaccgcacc 

gcgcaccggcttcgcccgcgcccccgcccgtcctttcctgtttccttgagatcacgtgcgccgccgaccgggaccgcgggaggaacggg 

acgtttcgttcttcggccgggagagtctggggcgggcggaggaggagacgcgtgggacaccgggctgcaggccaggcggggaacgg 

ccgccgggacctccggcgccccgaaccgctcccaactttcttccctcactttccccgcccagctgcgcaggatcggcgtcagtgggcgaa 

agccgggtgctggtgggcgcctggggccggggtcccgcacgggctccccgcgctgtcttcccagggcgcgacggggtcctggcgcgc 

acccgagggccgctgcccacccgccgagactgcctgtttagggaagctgaggaaggaacccaaaaatacagcctccgctcggaccccg 

cgggacaggcggctttctgagaggacctccccgcctccgcgctccgcgcaggtctcaaactgaagccggcgcccgccagcctggcccc 

ggcccctctccaggtccccgcgatcctcgttccccagtgtggagtcgcagcctcgacctgggagctgggagaactcgtctaccaccacctg 

cggctcccggggaggggtggtgctggcggcggttagtttcctcgttggcaaaaggcaggtggggtccgacccgccccttgggcgcagac 

cccggccgctcgcctcgcccggtgcgccctcgtcttgcctatccaagagtgccccccactcccgggaccccagctccctccgcgcccgc 

gccgaaagccccaggctctccttcgatggccgcctcgcggagacgtccgggtctgctccacctgcagcccttcggtcgcgcctgggcttc 

gcggtggagcgggacgcggctgtccggccactgcagggggggatcgcgggactcttgagcggaagccccg 
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FBN1 CpG Island (SEQ ID NO: 109) 



agagccgcgtctggagtgggctctcgacacccagggcaagtgggggcggcagagccctctcctcggtcggcacagcagcctctgccgcggtcccgg 
cctgcgacgcgcccagtcttagcctcccggcctccggcgtctgctgagtgtccggcgggagaggcgcagggagcgcgctaccgggaggcgcgggc 
agcggggactggtmctctcgggccagggcctccggggcaaccgtctccagcgcgcattcttggtgcaggtggaacagctttctgctccggtagggctt 
cacctatcgcgggagaggttaatctcggatctaaacctcgcagccgcagagcgggctaaaaccgctactccacctcttcccatttctcccctccccacctc 
aagacaaaaagtcccaggccgggcaggacctgatcacctctgcctcctcccactgcgctaatcctgcgagcgagaggccccgcaccgaggcggagg 
ctgcaaaggggagtggaaagggaggatggatggggccggggggtggggtggtgatgagggcgacgaaggagggggtgtcatWctttttctttcttttt 
ttaaaaaaagtatttctctcgcgagaaaccgctgcgcggacgatacttgaagaggtggggaaaggagggggctgcgggagccgcggcagagactgtg 
ggtgccacaagcggacaggagccacagctgGgacagctgcgagcggagccgagcagtggctgtagcggccacgactgggagcagccgccgccg 
cctcctcgggagtcggagccgccgcttctccagtgggtgcagccggggtccgacgggggtcgggcggccaccggggctggagctgcggccacgga 
ggcttttgcgtttgcgccgnnngagggcagggacagggactggggtgaggggctgtcccggaacgtccaacgtggncgctggaccctcccctgcct 
gacagcttcctgnccggggcttcttggtgccggnccggcgtcagatgttcggggggcggtgcatcgcccggagtcggcggggacggcgcggctgctt 
ccagctggcggagagggcaggctgaggagtggggcgttcagagcgcgcatcgcgcgcaattcgtgccgcteaaaaaaataaacccagagagctcgc 
ccggggcttaggaccgctggggatatgggtactttgcgccgcgctcttctggcggggcccgggaggcgagggattggccggggctgctgcgccggg 
gcctgggctttccagccagctgtggaccaaacggtcttcccttacccaaattaactgcgccacaggcggccgacnggttgggctttgggaatggggacc 
gcgagcttcagcatcccgatgccctgaaagtctccccgcctcggggatttgtctctgtgttgcagctggcaggggccgcctgaagtgggagcagcgcct 
ggagaaggcgggaggagcccggcccgggggacgggcggcgggatagcgggaccccggcggcgcggtgcgcttcagggcgcagcggcggccg 
cagaccgagccccgggcgcggcaagaggcggcgggagccggtggcggctcggcatcatgcgtcgagggcgtctgctggagatcgccctgggattt 
accgtgcttttagcgtcctacacgagccatggggcggacgccaatttggaggctgggaacgtgaaggaaaccagagccagtcgggccaagagaaga 
ggcggtggaggacacgacgcgcttaaagggtaaaggaaccggttccctc 

GPR37(SEQIDNO:110) 

Tcccgccccgcacccgcccctagcccgggctcggggacctgtcaggctggtttcgacagctggggaattaacctgtcccgcccatccctagcctcgag 
ccgcgcaggctccgcgcctccgcccttgttccctcccagctcctccgagtggaagccgctacaaatggcttgaatgaaacgtgtgtgggtttagtgagtg 
gtgaaccaccaggggatcccgtctccccacaaaccagtatctctccgaggaggaggcgaaggagtgggaggaggcaacgagccgagagtcgagctt 
cgcgggcgcgcgcagcggctggagcgcgggggcgaggccgggccacctccccttcccggccgcgcactgcctggcccgcggcggttccaggca 
ccacccttcccgtccgggctgagcccgctgtggcagtgactagctcccgcggctagcggcactgtccaccgacgagcggcgccctcttctcccccttct 
ccccacgatttccttctctgcggcggcacgccgtccagcagcctgcttcgccccgtcgtcaactttgagctggaggagaagcaactttggcagtggccgc 
ggggttggaatcccgcttctcctcggcagcagtaggctcgcaagtcgctggggttaggtggggcaagagtttcgccggcgcatcagcgctgcttcggac 
tgmgcaacgtgmccagcgagctgggagcggggttgtgactgcgagtcgtctgggggagggggacttgtmcttttcctctagagacctcggcttgca 
actggatcaaacgctgtcgaaa 



HSPA6 CpG Island (SEQ ID NO: 111) 

tgtattcgcatggtaacatatcttcggtcttcctgccgctgggctctcagcggccctccaaggcagcccgcaggcccgtgctcgcctcagggatcctccac 
agccccggggagaccttgcctctaaagttgctgcttttgcagctctgccacaaccgcgcgtcctcagagccagccgggaggagctagaaccttccccgc 
gtttctttcagcagccctgagtcagaggcgggctggccttgcaagtagccgcccagccttcttcggtctcacggaccgatccgcccgaaccttctcccgg 
ggtcagcgccgcgctgcgccgcccggctgactcagcccgggcgggcgggcgggaggctctcgactgggcgggaaggtgcgggaaggttcgcgg 
cggcggggtcggggaggtgcaaaaggatgaaaagcccgtggacggagctgagcagatccggccgggctggcggcagagaaaccgcagggagag 
cctcactgctgagcgcccctcgacgcgggcggcagcagcctccgtggcctccagcatccgacaa 
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IQGAP2 CpG Island (SEQ ID N0:1 12) 

Agagttcacttttacttcagtgtcagcgcgcggcggccgtggctggctctggcgagagagcaccgagggagtgggtcgcagatcttcgg 
gcggctaggggaaatcggcgagaggcgggatccgagcgcgccggcggggcgcagagcccgcgagcctggccagcgagggtagcc 
gcggggggcgcgccccgggcgggcccccggagacgcgcaggatgccacacgaagagctgccgtcgctgcagagaccccgctatgg 
ctctattgtggacgatgaaa 

KL CpG Island (SEQ ID NO:l 13) 

ctcgaaagaggggcgcgggtgggcgcgtctccccgcgagcatctcacctaagggggaatccctttcagcgcacggcgaagttccccctc 

ggctgtcccacctggcagtccctctaggatttcggccagtccctaattggctccagcaatgtccagccggagcttctttgggcctccgagtgg 

gagaaaagtgagagcaggtgcttccccagcggcgcgctccgctagggcccggcaggatcccgcccccaagtcgggaaaagttggtcg 

gcgccttttctccccgacgaagccgctccagggctgctctcagaggacgcgcggcaggcaaagagaatgaacctgagcgtccacgaaac 

gtcctgcacggctcccgggagctgggagaaacaggtgcctttctccgacgtccgcgggcgacgcctgccgcaccttgcccgctgccgcg 

cccctcccgggcacccctcgccctcggcgcccctgcccccacccccagtgccagggcggaggcagtcccggctcgcaggtaattattgc 

cagcggagcccgccggggagcgggggtgggcgcgccggcggtgggcgGgcgggcgcggcggggcgcgggcataaaggggcgc 

ggcgcggggccccggagcctggctcccgcgcagcatgcccgccagcgccccgccgcgccgcccgcggccgccgccgccgtcgctgt 

cgctgctgctggtgctgctgggcctgggcggccgccgcctgcgtgcggagccgggcgacggcgcgcagacctgggcccgtttctcgcg 

gcctcctgcccccgaggccgcgggcctcttccagggcaccttccccgacggcttcctctgggccgtgggcagcgccgcctaccagaccg 

agggcggctggcagcagcacggcaagggtgcgtccatctgggacacgttcacccaccaccccctggcacccccgggagactcccgga 

acgccagtctgccgttgggcgccccgtcgccgctgcagcccgccaccggggacgtagccagcgacagctacaacaacgtcttccgcga 

cacggaggcgctgcgcgagctcggggtcactcactaccgcttctccatctcgtgggcgcgagtgctccccaatggcagcgcgggcgtcc 

ccaaccgcgaggggctgcgctactaccggcgcctgctggagcggctgcgggagctgggcgtgcagcccgtggtcaccctgtaccactg 

ggacctgccccagcgcctgcaggacgcctacggcggctgggccaaccgcgccctggccgaccacttcagggattacgcggagctctgc 

ttccgccacttcggcggtcaggtcaagtactggatcaccatcgacaacccctacgtggtggcctggcacggctacgccaccgggcgcctg 

gcccccggcatccggggcagcccgcggctcgggtacctggtggcgcacaa 

PAR2 CpG Island (SEQ ID N0:1 14) 

Cccggggcgtggcctcccgcaggtgagtacgctgctccttcggtttccctgaaacctaacccgccctggggaggcgcgcagcagaggct 
ccgattcggggcaggtgagaggctgactttctctcggtgcgtccagtggagctctgagtttcgaatcggcggcggcggattccccgcgcgc 
ccggcgtcggggcttccaggaggatgcggagccccagcgcggcgtggctgctgggggccgccatcctgc 
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PITX2 CpG Island (SEQ ID NO:l 15) 



agtccgtgctcctgctcctcggttggctcctaagtgccccgccaggte 

ccctgcctcgtttctcgtcgcccctgctcgctccccccggcgctcgcccgggcgctgtgctcgctcctggatcgccagccgcgcagccgggctcggccggccgcccg 
cgcgccactgtgcagtggagtttggtggaatctctgctgacgtcacgtcactccccacacggagtaggagcagagggaagagagagggatgagagggagggagag 
gagagagagtgcgagaccgagcgagaaagctggagaggagcagaaagaaactgccagtggcggctagatttcggaggccccagtgcacccgtggactccttcgga 
acttggcaccctcaggagccctgcagtcctctcaggcccggctttcgggcgcttgccgtgcagccggaggctcggctcgctggaaatcgccccgggaagcagtggga 
cgcggagacagcagctctctcccggtagccgataacggggaaatggagaccaactgccgcaaactggtgtcggcgtgtctgcaattagagaaagataaaagccagca 
ggggaagaatgaggacgtgggcgccgaggacccgtctaagaagaagcggcaaaggcggcagcgga 

PTCA CpG Island (SEQ ID NO: 1 16) 

GCGGCCGCAGCGGCAGCAGCGCCCGCCGTGTGAGCAGCAGCAGCGGCTGGTCTGTCAACCGGAGCCC 

GAGCCCGAGCAGCCTGCGGCCAGCAGCGTCCTCGCAAGCCGAGCGCCCAGGCGCGCCAGGAGCCCGC 

AGCAGCGGCAGCAGCGCGCCGGGCCGCCCGGGAAGCCTCCGTCCCCGCGGCGGCGGCGGCGGCGGCG 

GCAACATGGCCTCGGCTGGTAACGCCGCCGAGCCCCAGGACCGCGGCGGCGGCGGCAGCGGCTGTAT 

CGGTGCCCCGGGACGGCCGGCTGGAGGCGGGAGGCGCAGACGGACGGGGGGGCTGCGCCGTGCTGCC 

GCGCCGGACCGGGACTATCTGCACCGGCCCAGCTACTGCGACGCCGCCTTC 

CCANGGNNGGCATTTCAGACT>mmm^ 

CCCCCGCCACACACAAACACACACACTlSrrCITCCrCTNTOT 

ACNTCTNCAGGAAAAGCAGCAGACAAATGGGGATTC 

GGAAAGGGCTGTCTGAGGTCCGCAGGGGGTGGAGGTGTGTGTGTGTGCGTGTGTGTGTGTGNANACAC 
ACGCCCTCCCTGGTGTGCCTTTTCCGGAGCACTGGAAAGCCGTCCACGGCGGACCACCTCAAGGGCGG 
CCGC 



GCGGTCGTAGCGGTAGTAGCGTTCGTCGTGTGAGTAGTAGTAGCGGTTGGTTTGTTAATCGGAGTTCG 

AGTTCGAGTAGTTTGCGGTTAGTAGCGTTTTCGTAAGTCGAGCGTTTAGGCGCGTTAGGAGTTCGTAGT 

AGCGGTAGTAGCGCGTCGGGTCGTTCGGGAAGTTTTCGTTTTCGCGGCGGCGGCGGCGGCGGCGGTAA 

TATGGTTTCGGTTGGTAACGTCGTCGAGTTTTAGGATCGCGGCGGCGGCGGTAGCGGTTGTATCGGTGT 

TTCGGGACGGTCGGTTGGAGGCGGGAGGCGTAGACGGACGGGGGGGTTGCGTCGTGTTGTCGCGTCG 

GATCGGGATTATTTGTATCGGTTTAGTTATTGCGACGTCGTTITCGNTTWGGAGNAGAT^ 

NGGTATTTTAGATTNTNTTNTTTTTATTTTNTTTT^ 

TATATAAATATATATATTNTITTTTTTTNTNTTTTATATATAATA^ 

AAAAGTAGTAGATAAATGGGGATTGAAAAATTTAAATTTTTTTTTTGGTNOT 

TTGAGGTTCGTAGGGGGTGGAGGTGTGTGTGTGTGCGTGTGTGTGTGTGNANATATACGTTTTTTTTGG 
TGTGTTTTTTTCGGAGTATTGGAAAGTCGTTTACGGCGGATTATTTTAAGGGCGGTCGT 



PTCHB CpG Island (SEQ ID NO: 1 17) 

GCGGCCGCGGCACTGTCCTGCCCCGTGCCCCCTGCCCTGAACTTCTTCCTCCTGCGCCCCTGCCCCTAT 

TTGCAGCCTAAACTCCTGTACGGCTGCCACATTTCTTAACATCTTGGAGGGGGAGGCGGAGTGGAGAG 

AGGCGGAGAGAGGAAGGGGGGAGGGAGCCGAAATAAAGGTGGTTTCCTTTTITGGCAGCCAGTTTTG 

GTTTTGTTGAGCATGAAATCTCTGCTCCCTrAAAAAATTATTCTCGGAAAAAGATATCCCCCCCGTT^ 

CCAGGTTTTGAGCCGCCTCTCCTTAGGGCCTGGTCGGGGGAGGAAAAGTTGTAAACAAATTGCCACCT 

TAAATTCGCGGTGCGANTCTGCGGAGCTGCCGGGTTCATTGTGTTTACGAGGCTCGCTGAAATGTGTG 

GAATCCAGGGAAGGCGAGCACCCAGACGGGGGCCCGCCGGGGCCGCGGCCAGCGCCGGGGAAATGC 

CGCGCCGGGGAGCAGCATGCGCCGGCCTGAGCCCTTCCCITTGCACTCGGCTGTTTTTTACGTTTAACC 

AGAAAGGAAGGGAGAGGAGGGAAAGATCCATGTGGCTGCCCTCTTCCGATCACAAATATTGTCGTAA 

GTTGCAGCTGGCrGCCCCANTTCCTAATTCAGCTCACACAGCNTNTCCCCACGCTATGGAAATGCGTCG 

GGAGTGAACTCCGGCGGCCGC 



FIGURE 4F 



SDC1 CpG Island (SEQ ID NO:l 18) 

Ggagaggtgcgggccgaatccgagccgagcgagaggaatccggcagtagagagcggactccagccggcggaccctgcagccctcg 
cctgggacagcggcgcgctgggcaggcgcccaagagagcatcgagcagcggaacccgcgaagccggcccgcagccgcgacccgc 
gcagcctgccgctctcccgccgccggtccgggcagcatgaggcgcgcggcgctctggctctggctgtgcgcgctggcgctgagcctgc 
agctggccctgccgca 



SDC4 CpG Island (SEQ ID NO:l 19 ) 

Agtaggagccggcgggctcgggcagggcgggtcccttggggtttccaactccgcgggcgggcgcagtgccccgcaggcctcgcttcc 
actggggaattccgggcggggtgcgggcggcggggcgggggcgggccggggcggggccggtaggccgcctataagatgggtggcg 
cgcccgcccggggccactcgccgcagcctgcgcgccttctccagtccgcggtgccatggcccccgcccgtctgttcgcgctgctgctgtt 
cttcgtaggcggagtcgccgagtcggtgggtgcttggaggttcccgggctgggggcgaagcgggggcgcaggccggtgcctcctttgtt 
cgtcggagcgtgggatgggggggggcagatcgggggtacgctacccccaaccggacaccgaggcccgggaaactttgttggaaacttt 
gctccggggtcacgggccagcctccgggatggcttcacgcgccgtgcgcccctcgcctgttgctcttcccgcctccccgggcctcagccc 
cgccgcgggctacgggctcgttagtgactaagccggtgtcaactcttcaactcccacaccctcgtcccttccctggtgaccctggggcagg 
cttggagcgctgaatcccctcctcgctctcggggcgcccagagcagacagctttaggatccgagatggccctgggggtcggggggctgc 
gtgtactcggaagggggagggttttagggttgtgcgaggccc 



FIGURE 5 



MINT31 (SEQ ID NO: 120) 
CCCGGGGCCT CTATCCTGGC GGGAAGGGCA 
AGGGAAGAAG GTGTCAGACG CGCGGAGCAA 
GCACGGGGTT CAAGACTCAG GCGCCGCATA 
AAAAAAGGGC ACTTAGGGGA TCTGCTCATT 
TCATTTACAC AACTCTGTGC ATGGATTCGG 
GCAGCCACTC TGCCACTTCC CCCATCCCCT 
CGACCCCAAC CCTCTGCGGC GGCGGGAGGT 
AGGGGAGTGA ACAACGCCCA GGGGTGATGG 
TCATCCGACC CATCCCGCCG CCTCTCCGCA 
NGGAACTGAG TGACCCACTA GGTTCCGCCG 
CTTCCCGGCG TCTAATCCTC GCTGTTCCCC 
CTCCGCTCCC GGG 



GGCCGACCCG GCAGACTGCG GCCTCTCGGG 
CCATAAATAG CCCCCCTTTC CCAGAAGACG 
CTCAGAATGA GAGCAGAGAC TCCCGCCAGG 
AACATGAAAT GCAAATGAGC CCGCCCGGCC 
CGAAAGGGCA ACCAGGGAGA CGACGGCGCA 
CCCCCCCATC GGCCGGGGCG GGAACTGAGA 
GCGCGGGGGC TGCGTGGGTG GTGCAGCCTT 
CCTCAGCAAA GTGAGGGGTG GTGATGGAGG 
GTGGCGCAAG CGCCCCAAAA TCTCCGGAGA 
TGTCTACCTC TCGCAGATGT TGGGGAAGTG 
CCTCCACCGG CGCCCAGCAC ACCCGCGGCG 
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